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Preface 



In September, 1980, the Office of Naval Research and Air Force 
Office of Scientific Research jointly sponsored the Lake Wilderness 
Conference on Attention. The conference was hosted by the University 
of Washington at their Lake Wilderness facility outside of Seattle. 
This technical report includes papers by the eleven speakers at that 
conference. In some cases the papers are revised versions of the 
actual transcripts of the talks. In other cases, the authors have 
chosen to entirely rewrite their presentations. We have reproduced 
the papers with a minimum of editing and retyping in order to shorten 
^ the period between submission and circulation. 
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Attention to Action: Willed and Automatic Control of Behavior 
Donald A. Norman and Tim Shall ice 



Abstract 

The major theme of the paper is that the prinary role of attention 
is in the control of action. The basic idea is that human action 
sequences can run themselves off, efficiently, smoothly, without any 
need for deliberate attention. However, when modifications in a plan 
must be made, or when it is desired that some novel alternative action 
sequence be followed, or when it is desired to prevent some habitual act 
from occurring, then it is necessary fox deliberate atrentional inter- 
vention into the process. 

We argue that most attentional conflicts occur with the initiation 
rather than the execution of actions. We suggest two levels of control: 
a contention scheduling mechanism that selects from among competing 
schemas; a supervisory attentional mechanism that biases the selection 
process. We propose that the supervisory attentional system is required 
where the action sequences are ill-learned or novel, where the action is 
highly critical or dangerous, or where planning is required. In other 
cases, selection is by contention scheduling alone. The result is three 
modes of the control of performance: automatic, contention scheduling 
without deliberate direction, and deliberate conscious control. Will 
becomes the application of attentional resources to the control of 
action. 
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Attention to Action: Willed and Automatic Concrol of Behavior* 

Donald A. Norman 
University of California, San Diego 
La Jolla, California 92093 

Tim Shal lice 
Medical Research Council 
Applied Psychology Unit 
Cambridge , England 



During the performance of a compl x action sequence, many different 
action components are likely to be active at any moment. This results 
f^om the fact that any particular action sequence is apt to be comprised 
of numerous components that are to be performed at different times. 
Moreover, in the conduct of their normal everyday activities, people 
often interweave a number of action sequences, doing several activities 
during overlapping time periods. Thus, an activity suctv as writing a 
letter can occupy considerable duration, ahd it is performed while 
engaged in other activities — listening to music, conversation, eating, 
or drinking. The writing of the letter itself has Inany different levels 
of operation, ranging from organizational aspects to the detailed motor 
movements that cause the appropriate marks to appear on the paper. 

The initiation of any individual action component can be relatively 
straight-forward; do the action as soon as the appropriate triggering 
conditions occur. Complications occur in setting up the appropriate 
conditions and analyses. Complications also occur when numerous action 
components compete for overlapping use of some of some limited resource 
or for related structures, or when the processing structures and condi- 
tions are not set up sufficiently precisely that they will perform prop- 
erly without some other level of monitoring and control. In this paper 
we propose a mechanism that allows for several control structures to 
interact in order to achieve smooth non-conflicting operation of the 
numerous action components that might be simultaneously awaiting their 
turn for action. We consider separately several different aspects of 
the situation: first, the nature of the knowledge structures that con- 
trol actions; then what we call the "horizontal threads" that specify 

*Research support to D.A. Norman was provided by the Office of Naval 
Research under contract N00014-79-C-0323. The collaboration was made 
possible by a grant from the Sloan Foundation to the Program in Cogni- 
tive Science at UCSD. Support was also provided by grant MH-15828 from 
the National Institute of Mental Health to the Center for Human Informa- 
tion Processing. We thank members of the "Skills" group of the Cogni- 
tive Science Laboratory at UCSD, especially David Rumelhart, Geoffrey 
Hinton, Wynne Lee, Jonathan Grudin, and Bernie Baars. We appreciate 
thoughtful reviews and comments by Roy D'Andrade, Steve Keele, John 
Long, George Mandler, and Peter McLeod. Approved for public release; 
distribution unlimited. 
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attentional control; and finally, a mechanism for conflict resolution. 

The Structure of the Paper 

Our goal in this paper is to provide an account for both the 
experiential and the experimental phenomena of attention. We do this by 
examining what is known cf the phenomena of attention and proposing a 
theoretical framework. The framework is structured around the notion of 
a set of active schemas, organized according to the particular action 
sequences of which they are a part, awaiting the appropriate set of con- 
ditions so that they can become selected to control action. The 
analysis is therefore centered around actions, primarily external 
actions, but the same principles apply to internal actions — actions 
th^t involve only the cognitive processing mechanisms. The organization 
of this paper is first to specify the theoretical framework that will 
guide the later analysis, then to examine some of the phenomena of 
attention, first the experimental, then experiential, and finally, the 
neuropsychological. But before we start the theoretical framework, a 
brief review of some of the experiential phenomenology that surrounds 
the use of the term "automatic" is appropriate, for this conception 
plays a major role in the development of our ideas. 

Automatic Performance 



The term "automatic 11 is one of the more ubiquitous in the 
phenomenology of attention. However, the term has a number of dii V 
ferent, though related, meanings. Experiential ly, there are at least^ 
three different meanings to the term. First, there is the way that cer- 
tain tasks can be executed, without awareness of their performance (as 
in walking along a short stretch of flat safe ground). Second, actions 
may be both initiated and performed without deliberate attention or 
awareness (as in the automatic brushing away of an insect from one's 
arm). Third are cases like the orienting response, in which attention 
is drawn "automatically" to something, with no deliberate control over 
the direction of attention. 

In addition, there are cases in which one can be passively aware of 
performing the actions, but without placing deliberate attention to 
them, and without any attempt to control them; an example of this latter 
type occurs in the performance of a skilled athletic task, where one 
might consciously be attending to the opponent, but be fully aware of 
the "automatic" hitting of the ball. Finally, within contemporary cog- 
nitive psychology, the term "automatic" is often defined operationally 
to refer to situations in which a task is performed without interfering 
with other tasks. In this situation, automatic is defined to mean that 
the task is performed without the need for limited processing resources 
(Shiffrin & Schneider, 1977). 

The different uses of the fenn "automatic" require different expla- 
nations. We return to this point after we have outlined the theoretical 
framework. Basically, however, the model that ve propose presumes that 
many action sequences are performed without any need for conscious 
awareness or attentional resources. It is only during the initiation or 
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termination of sequences that attention is apt to be required, and then 
only with ill determined, or poorly learned tasks, or when the situation 
is determined to be critical or dang erous. The different senses of the 
term reflect different aspects of the mechanism, resulting from whether 
the control is entirely without attentional resources, or requires 
supervisory control, or attentional monitoring. 

Relationship to Previous Work 

The theoretical ideas de/eloped in this model are consistent with a 
number of developments in the psychological literature on attention and 
the control of action. The emphasis that attentional limitations will 
Have their major effect at the action end of analysis, with 'considerable 
parallel and non-conflicting processing prior to the initiation of 
action, is related to work of Keele (see the chapter by Keele & Neill, 
1978). The basic notion that attentional processes play an overseeing 
role, activating whatever processing component is in need of supervisory 
assistance, has been suggested by LaBerge (1975), LaBerge and Samuels 
(1974); and Klein (1976). It is related to Posner's views of atten- 
tional biases providing costs and benefits in the production of 
responses (Posner, 1978). Our resource notions originate with Kahneman 
(1973), elaborated by Norman and Bobrow (1975) and Navon and Gopher 
(1979). Shallice's earlier work on the role of consciousness and action 
systems (Shallice, 1972, 1978), Norman (1981) on schemas and control 
structures, and RumelhaLt and Norman (Note 3) on typing have played 
major roles in the theory that we have developed. The notion of schema 
has, of course, been around for a while, being introduced for motor 
actions by Bartlett (1932) and used for this purpose by Schmidt (1975). 
A more complete view of the views of schemas consistent with our usage 
is presented by Rumelhart and Ortony (1977). 

A Theoretical Framework for the Analysis of Attention 



Schemas and Processing Structures 

Action sequences are complex ensembles of coordinated motor 
responses, oftentimes requiring some mental computation and decision 
making and considerable use of knowledge from the memory systems. We 
assume that specification of the components of actions and processing is 
don~ by means of numerous memory schemas, some organized into hierarchi- 
cal or sequential patterns, others in heterarchical or independent 
parallel (but cooperating) patterns. Any given action sequence that has 
been well-learned is represented by an organized set of schemas, with 
one — the source schema — serving as the highest order control. The 
term "source 11 is chosen to indicate that the other component-schemas of 
an action sequence can be activated via the source. The procedural 
aspects of schemas require processing structures that carry out the 
operations s>ecified, resulting in actions either upon an internal data 
base or upon the outside environment via the limbs and speech organs. 

Conflicts in action sequences can arise for numerous reasons: 
several actions might require incompatible use of the same processing 
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structures (as in simultaneously attempting to raise and to lower the 
hand); an action might require a difficult and unpracticed use of 
related structures (as in reaching for an object while making a precise 
movement of the leg); the action could require resources in excess of 
the capacity of a particular structure (as when attempting to do complex 
mental arithmetic while also retaining some items in snort-term memory); 
or the result of one activity might preclude successful completion of 
another (as when eating dinner at location A precludes eating dint^r at 
location B). Part of the difficulty in selection and scheduling of 
action components is to avoid incompatible or conflicting use of pro- 
cessing structures and to prevent the joint occurrence of other competi- 
tive activity. We propose that this occurs through selection, competi- 
tion, and negotiation among schemas. 

In the model there are three different states of a schema: dormant , 
activated , and selected . The state of dormancy is the normal, neutral 
state of the schema: a schema is dormant when it resides within the 
permanent memory structure, playing no role iu the ongoing active pro- 
cessing of the moment. A schema is activated when it is set up, brought 
to a state of readiness and given an activation value. The acti/ation 
value is determined by the combination of several factors, including rhe 
value given to it at set up by its source schema, the results of deli- 
berate attentional activation or of motivation, the influence of the 
interaction with other activated schemas, and the goodness of match of 
the conditions within the trigger data base to the trigger conditions 
specified for the individual schema which determine the conditions under \ 
which it should be invoked. A schema is selected when its activation 
value is sufficiently high to exceed its own threshold* A selected 
schema controls actions, both internal processing and external tnovenents 
of effectors. 

We now describe these aspects in more detail and introduce an 
interaction of horizontal and vertical thread structures, and scheduling 
mechanisms . 

Horizontal Threads 

Start by considering a simple, self-contained, well-learned action 
sequence, perhaps the act of depressing a response switch upon the 
flashing of a particular light. This action sequence can be represented 
by a set of component schemas, triggered by the arrival of the appropri- 
ate perceptual event and resulting in the selection of the proper body, 
arm, hand, and finger movements to depress the button. Some or all of 
this processing sequence could be set up in advance by activation of the 
appropriate source schemas which in turn activates the detailed com- 
ponent schemas for carrying through the desired sequence of action upon 
the specified flash of light. 1 Whenever the action sequence is set up, 



1. Just how much of the details of an action square can be preset is a 
point that needs to be empirically examined. The observation that the 
latency of a response is proportional to its complexity argues against 
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its representation by means of action schemas constitutes a horizontal 
thread. The important point is that the processing structure can in 
principle be well specified. 

The general nature of the processing structure for a simple action 
sequence is shown in Figure 1. The essential components are shown in 
the horizontal grouping of component schemas for an action sequence: 
this is a horizontal thread of processing structures. In this example, 
four component schemas are shown, receiving information about sensory 
and motor activity from a "trigger data base" and making use of "psycl 
logical processing structures" in transforming their outputs into 
actions. For many actions, the specific processing units involved would 
be much more complex. Thus in the skilled operation of a motor skill 
such as writing to dictation, a complex set of specific processing units 
would be involved, including storage buffers of various sorts (see 
Ellis, 1980; Morton, 1980; Wing, 1978). Moreover in such skills, the 
conceptual relatiors among processing units and the initiation and exe- 
cution of component schemas would be considerably more complex than the 
linear relation shown in Figure 1. However, we let the schematic con- 
ceptualization of the horizontal thread symbolize the specification of 
the processing structures, regardless of their actual complexity. A 
horizontal thread, therefore, stands for the specification of the com- 
ponents of the processing structures that control the over-learned 
aspects of action. 

In general, there will be a number of different action sequences 
being perforated at any given rime, each specified by its own horizontal 
thread structure, as shown in Figure 2. The operations performed by 

the component schemas that comprise the horizontal threads include 
inte v nal operations upon a memory data base, the formation and set up of 
othar schemas or processing threads, and external operations such as 
speech or movement. Different component schemas might all access a com- 
mon memory data base, and they might need to use the same processing 
mechanisms ( e.g. , the same memory structures or particular muscle 
groups). As a result, the different threads may interact with one 
another, as bymbolized by the lines in Figure 2 interconnecting the 
threads. These Interactions are important for the contention scheduling 
mechanism (to be described later). 

Schema Selection Mechanisms 

When numerous schemas are activated at the sam* time, some means 
needs to be provided for selection of a particular schema when it is 
required for its action sequence. At times, however, there will be con- 
flicts a: , ig potentially relevant schemas, and so some sort of conflict 
resolution procedure must be provided. This is a common problem in any 
information processing system where, at any one moment, several 



full, detailed specification of the motor schemas prior to the trigger 
signal (Kerr, 1975; Sternberg, Monsell, Knoll, & Wright, 1978)- Our 
concern is independent of this consideration. 
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Figure 1. A Horizontal • Thread. For well-learned, habitual tasks an 
autonomous, self-sufficient strand of processing structures and pro- 
cedures can usually carry out the required activities, without the neea 
for conscious or attentional control. Selection of component schemas is 
determined, in part, by how well the "trigger conditions" of the schema 
match the contents of the "trigger data base." Such a sequence can often 
be characterized by a (relatively) linear flow of informatics .mong the 
various psychological processing structures and knowledge schemas in- 
volved: a horizontal thread. 
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Sensory 
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Figure 2. Simultaneous Horizontal Threads. A person often performs 
several tasks at the ime time, with the individual components of each 
task either being simultaneous or overlapping in time. Moreover, any 
given task may last for a considerable amount of time. This figure 
shows 5 different horizontal threads that might be active at one time. 
Some means of selecting the individual schemas at appropriate times 
while providing some fon of conflict resolution becomes necessary. The 
interaction among the various horizontal threads needed for this purpose 
is indicted by th^ lines that interconnect schemas from different 
threads • 
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potential candidates for operation might require access to the same 
resources or might result in incompatible actions. (McDermott & Forgy, 
1978, discuss this issue for production systems and Bellman, Note 1, 
discusses the problem with respfect to animal behavior.) 

The procedure we propose is constrained by the desire to transmit 
priorities by means of the single variable of amount of activation, a 
concept consistent with current psychological theory. We propose that 
the individual component schemas of the horizontal threads each have an 
activation value that is determined by a combination of factor*,, some 
that operate among schemas, some that result from special processes that 
operate upon the schemas. 

We divide the activational influences upon a sciitma into three 
sets: vertical thread influences, contention scheduling influences, and 
trigger condition influences. Horizontal threads determine the organi- 
sational structure of the schemas and processing mechanisms for a par- 
ticular action sequence. Vertical threads determine biases acting upon 
the selection process. Trigger conditions determine the appropriate 
timing for the initiation of schemas, and the contention scheduling 
mechanism combines these various influences and selects between candi- 
date schemas where appropriate. 

Vertica l Threads 

The horizontal thread specifies the organizational structure for 
the desired action sequence. However, a scheme may not be available 
that can achieve control of the desired behavior, especially when the 
task is novel or complex. In these cases, some additional control 
structure is required. The vertical thread influences provide one 
source of control upon the selection of schomas, operating entirely 
through the application of activation values to the schemas that can 
bias theii selection by the contention scheduling mechanisms. There are 
two major vertical factors: motivational variables and attentional con- 
trol. It is this latter factor that Is the focus of this p«per. The 
overall system is shown in Figure 3. 

At tentional resources . Deliberate attentional control is the most 
important of the vertical thread fences. Here we postulate that a 
supervisory attentional mechanism is capable of monitoring the overall 
activity, then of supplying an Increase or decrease in the activation 
valaes of the relevant schemas. Note that this is an indirect means of 
control of action. Attentional control is directed only at activation 
value, not directly at the selection. Moreover, it is control overlaid 
on top of the horizontal thread organization. When attentional activa- 
tion of a schema ceases, the activational value will revert to its nor- 
mal value. 

Allport (1980) has criticized a wide range of attention theories 
for succumbing to what he calls the "GPLCCP" belief in a "General- 
Purpose Limited Capacity Central Processor. 11 We agree with much of his 
criticism, and our proposal is meant, in pr rt , to overcome these 
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Figure 3. Vertical Threads. When attention to particular tasks is 
squired, either because the components of the relevant horizontal 
threads are not sufficiently well specified or because some critical or 
dangerous situation is involved, then vertical thread activation comes 
into play. Attention operates upon schemas only through manipulation of 
activation values, increasing the values for desired schemas, decreasing 
(inhibiting) the values for undesired ones. Thus, at tentional processes 
oversee and bias ongoing action by alteration of activation values. 
Motivational variables are assumed to play a similar role in the control 
of activation, but working over longer time periods. 
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criticisms- The horizontal processing threads represent particular 
strategies for performing tasks, making use of whatever processing 
structures seem needed. Several such threads may operate simultaneously 
provided that no related processing structures are simultaneously 
required and provided all the schemas involved are well-learned. Hor- 
izontal thread control is not subject to Central Processor limitations. 

The model does contain a general purpose limited capacity mechan- 
ism, a supervisory attentional mechanism whose influence is felc by 
threading its way vertically across the active schemas. Mechanisms that 
• are concerned with planning or monitoring of actions as contrasted T ith 
the detailed execution of the task solution can play important roles in 
overseeing the satisfactory operation of complex systems. Thus, such 
mechanisms have been incorporated into a number of models of problem 
solving programs (Boden, 1977; Fahlman, 1974; Sussman, 1975). In the 
present model, this mechanism only serves as a mediating influence; it 
can only modulate the flow of processing. Oftentimes, this modulation 
is critical for the successful operation, and whenever this is the case, 
central processing limitations can occur. But we presume the whole 
action system refines itself through experience, developing and adjust- 
ing the horizontal thread structures to minimize the need for central, 
vertical thread modulation. 

Mot ivation . A second vertical thread component results from the 
effects of motivational factors. We take this to be a relatively slowly 
acting system, working primarily to bias the operation of the horizontal 
thread structures towards the leng-term goals of the organism by 
activating source schemas (and through their selection, component sche- 
mas). Memory organizational procedures, for both storage and retrieval, 
are themselves horizontal thread structures, and so are susceptible to 
motivational biases. 

Contentio n Schedul ing 

Simultaneously performed actions are sometimes in conflict with cae 
another, but at other times can be jointly performed by cooperative 
action. In the case of conflict, when one task is started, something 
must prevent simul taneous performance of the other. But with coopera- 
tive tasks, the situation is quite different. Oftertimes the lower- 
level activities required for a single, higher level task are both 
cooperative and in conflict. To see this, consider how a skilled typist 
types the word "very 11 (using a standard typewriter keyboard): the posi- 
tioning ot the hands and fingers is both cooperative and competitive. 
The typing of the V, "e", and "y" is cooperative; as the left index 
finger positions itself to type the V, the middle finger of the left 
hand can start its movement towards the "e" and the right index finger 
can posit ioa itself for the "y' 1 * The hands and arm position themselves 
so as lo assist in the finger movements. In contrast, the typing of the 
"v" and the "r" is competitive, both requiring conflicting use of the 
bame finger. Analyses of high speed moving films of skilled typists 
indicates that both competitive and cooperative interactions occur 
(Gent nor, Grudin, & Conway, Note 2). 

17 
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To permit simultaneous action of cooperative acts and prevent 
simultaneous action of conflicting ones is a difficult job, for often 
the details of how the particular actions are performed determine 
whether or not they conflict with one another. Thus, whether or not 
several objects can be picked up at one time with the same hand is 
determined by the shape and location of the objects, the exact use that 
is made of the fingers, and the skill and experience of the person. 
Some sort of conflict resolution mechanism Jiust come I Tito play to 
resolve these issues. 

We propose that the scheduling of actions takes place through what 
we call "contention scheduling," in which active schemas interact with 
one another through inhibition and, to a lesser extent, excitation of 
activation values. The result is competition for selection, modulated 
by the vertical thread activations, preventing competitive use of common 
or related structures and negotiating cooperative, shared use of common 
structures or operations where that is possible. (This principle is 
used by Rumelhart & Norman, Note 3, in a model of the hand and finger 
interactions in skilled typing.) 

We assume that the initial activation values of component schemas 
are determined by means of their source schema. For example, when the 
source schema for a task such as driving an automobile has teen 
selected, all its component schemas become activated, including schemas 
for such acts as steering, stopping, accelerating, slowing, overtaking, 
and turning. Each of these component schemas acts as a source schema, 
activating its component schemas (braking, changing gear, signalling). 
The one remaining consideration for the selection process is the satis- 
faction of the contention trigger conditions. 

Trigger conditions . The determination of the activation level of a 
schema from higher-level systems is n>rmally not sufficient to provide 
adequate timing of its selection. A particular action must be done at a 
time dependent upon the occurrence of appropriate environmental events. 
We propose that proper timing of schema selection depends upon the 
satisfaction of trigger conditions that specify the exact conditions 
under which selection is appropriate. Trigger conditions then contri- 
bute to the overall activation of the schema. How well the existing 
conditions match those trigger specifications determines the amount of 
activation contributed by this factor. (Attentional or motivational 
influences on activation value can override the effect of the trigger 
conditions, causing selection even in the absence of appropriate 
triggering conditions, or inhibiting selection even when there is per- 
fect match of triggering conditions.) 

The selection mechanism . There are two basic principles of the 
contention scheduling mechanism: first, the sets of potential source 
schemas compete with one another in the determination of their activa- 
tion values; second, the selection takes place on the basis of activa- 
tion value alone — a schema is selected whenever its activation exceeds 
a threshold value. The threshold can be specific to the schema, and 
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coeld become lower with use of the schema. 

The competition is effected through lateral activation and inhibi- 
tion among activated schemas. What degree of lateral inhibition exists 
between schemas on the model remains an open issue, Schemas which 
require the use of any common processing structures will clearly need to 
inhibit each other. Yet the degree of inhibition cannot be determined 
simply a priori. Thus some aspects of the standard refractory period 
phenomena can be plausibly attributed to such inhibition between sche- 
mas; explanations based upon conflicts in reponse selection fit the data 
weJl (Kahneman, 1 973) • However > one cannot just assume that responses 
by each of the two hands inevitably involve a common processing struc- 
ture, as refractory period effects can disappear if highly compatible 
tasks are used (Greenwald & Shulman, 1973). On the model, as a task 
becomes bet ter learned, the schemas con troll ing it could become more 
specialized in their use of processing structures, reducing potential 
structural interference and minimizing the need for mutual inhibition 
among schemas. At the same time, a factor that may operate to broaden 
lateral inhibitory interactions among schemas is the possibility of 
interactions among anatomically related subsystems, even when they are 
functionally distinct (Kinsbourne & Hicks, 1978). 

Simultaneous selection of two schemas is unlikely for two reasons. 
First, if the two are at all incompatible, lateral inhibitory processes 
will tend to reduce the chance of at least one reaching threshold, as 
such effects magnify existing activation differences. The decrements 
will be greater the less well learned and, hence, the less specific the 
schemas are. Second, if deliberate attention is required to activate 
them sufficiently for selection, there will be competition for the lim- 
ited resources of the supervisory attentional mechanism. This will be 
especially the case when tasks are not well learned. An ill-learned 
schema is itself a poorly integrated fcroup 0 f element schemas, and the 
supervisory attention mechanism will nead to boost all the elements 
rather than the single whole. 2 

Note that the operation of a selected schema continues, unless 
actively switched off, regardless of what value its activation may have 
fallen to, until it has satisfied its goal, completed its operations, or 
until it is blocked when some resource or information is either lacking 



2. Although simultaneous selection of two schemas is unlikely, simul- 
taneous and synchronized performance of two action sequences is quite 
possible. If a person needs to perform two action sequences concurrent- 
ly, both of which require conscious attention, then the simplest way to 
accomplish this is to establish a sinele, higher-order source schema 
that oversees both actions. Then, "attention 11 mied be directed only at 
a single schema. If the underlying component schemas are automated (so 
they need not pass through contention scheduling), the result will be 
synchrony in action. Thus, in general, actions that require attentional 
control must either be performed with exact synchrony or in alternation, 
first one, then the other. 
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or is being utilized bv some more highly activated schema. The activa- 
tion value is important primarily in the selection process and when the 
selected schema must compete for shared resourcas, or in p-oviding com- 
ponent schemas with initial activation values. 

The scheduling is therefore quite simple and direct. No direct 
attentional control of selection is required (or allowed). Deliberate 
attention exerts itself indirectly through its effect on activation 
values. All the action, therefore, takes place in the determiuation of 
the activation values of the schemas. 

Fine tituing of operations . A critical component of the performance 
of many skills is fine timing of the operations* As the selection 
mechanism is now constituted, it cannot be counted upon to respond as 
precisely as is required. Even if triggering conditions were the same 
on different occasions, the contention selection mechanism would be apt 
to lead to variability in selection time, in part because it would be 
unreasonable to assume that other factors affecting activation level 
were constant. 

One possibility is for the contention scheduling mechanism to be 
used only for initial selection of schemas and for crude timing. Pre- 
cise timing would then be handled by means of specific triggaring condi- 
tions at appropriate (low) levels of component schemas. Component sche- 
mas would then specify precise triggering conditions required for the 
actions under their control so that this level of control would not be 
done by the contention scheduling mechanism. This assumption also 
allows negotiations among schemas that are simultaneously operating to 
take place at this same level, so that one schema operates in such a way 
as to allow as much as possible of the other schema to be realized. 
Thus, if both a paper-picking-up schema and a pencil-picking-up schema 
are operative, the hand and finger configuration used for picking up the 
paper is likely to be modified so as to allow for the picking up of the 
pencil v*ith the unoccupied fingers. 
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The Interpretation of the Phenomena 



A key component of our theory is the role of attention to action. 
However, many of the experiments that have been performed to examine 
attentional phenomena have concentrated upon the role of attention in 
perception. As a result, many of these experiments do not test our 
ideas directly, but rather require explanations that cut across the 
several theoretical mechanisms that we describe. 

One major class of phenomena do fit reasonably directly into the 
theoretical structure of the model. These are certain of the phenomeno- 
logical aspects of attention, which are subject to reasonable agreement 
among observers, but available only through introspection rather then 
hard experimental evidence. In this section of the paper we discuss 
both the experimental and the introspective, phenomenological evidence. 

The Experimental Literature 

According to our framework for the control of action, simultaneous 
tasks interfere with one another in one of two ways: structural 
interference, when two horizontal threads overlap in their demands for 
processing structures; and attentional or resource interference, when 
attentional capacity is required from the vertical thread. One can 
attempt to control both these forms of interference, the structural 
interference through the appropriate design of the tasks that are to be 
performed at the same time, the attentional interference through suffi- 
cient (albeit lengthy and tedious) training on the tasks so that they 
can be performed "automatically, 11 without the need for attentional bias- 
ing of the contention scheduling mechanism. 

The best way to assess these ideas would be to have experiments 
that separate cleanly the demands placed upon horizontal and vertical 
thread components. We expect attentional limitations to effect only 
vertical thread processes and structural interference to affect only 
horizontal thread influences. However, as we discussed earlier, the 
structural interference between any pair of tasks cannot readily be 
determined a priori. Thus, the existence of interference between tasks 
does not speak directly to the theory. Rather, the more important pred- 
ictions conceal the conditions for which there will be no interference 
between Msks, as this requires that there be neither structural (hor^ 
izontal) nor attentional (vertical) interference. 

Simultaneous tasks . On the model , satisfactory performance of 
several simultaneous* tasks depends upon lack of conflict of these tasks 
for any of several kinds of resources. The major prediction is that 
parallel dual-task performance should be most easily possible where only 
a single action sequence has to be initiated, that is, where only one 
schema has to be selected by contention scheduling. This fits with the 
results on nonitoring (for review see Duncan, 1980). Thus Moray and 
Fitter (1973) and Sorkin and Pohlmann (1973) showed that monitoring for 
one of several possible signals (or signals over several possible 51 
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channels) could be done satisfactorily, and that performance suffered 
only when simultaneous target detection and responding was required (in 
our terms, when there was initiation of actions). 

Effective multiple channel mcnitoring depends on the activation of 
appropriate schemas by triggers that have been set up previously. There 
can, however, be difficulties in setting up the schema trigger condi- 
tions so as to match the expected signals unambiguously. For instance, 
how novel may the set of trigger conditions be? The Shiffrin and 
Schneider experiment (1977) in which the "automatic" attention was not 
possible when the target distractor relation was varied from trial to 
trial indicates that considerable practice is required to set up some 
trigger conditions. A second problem is that activation-based processes 
are liable to result in selection induced by stimuli similar to targets. 
The experiments by Treisman and Gelade (1980) seem to indicate that some 
situations which require the integration of "separable features" (where 
the background distractors may also contain these same features) present 
special difficulties of this sort. Thus, in some cases, effective tar- 
get selection cannot be performed without internal processing actions. 
On the current model this is apt to require attentional activation of 
the processing schemas, thus forcing deliberate serial attention to be 
directed at the parts of the signal. 

Simultaneous performance of tasks whic»> require production of 
separate response streams is possible when two conditions are satisfied: 
first, the horizontal thread structures must be sufficiently developed 
that they can control the action sequences without deliberate supervi- 
sion; second, there cannot be any structural interference. The first 
condition is obviously most apt to be satisfied when there has been con- 
siderable practice at the tasks. The second condition requires minimi- 
zation of the overlap of use of common processing or control mechanisms. 
These conditions have been satisfied in a number of experiments that 
have examined performance of highly skilled, well-practiced people 
(AJlport, Antonis, & Reynolds, 1972; McLeotf, 1977; Spelke, Hirst, & 
Neisser, 1976). 

Even in these situations, performance oft^n deteriorates somewhat 
when a second cask is added, although there appears to be no obvious 
grounds for structural or attentional interference. A common explana- 
tion of this finding is that there is an excess "overhead" associated 
with the performance of two tasks rather than one, an overhead that 
leads to a performance decrement regardless of the nature of the tasks 
(e.g., Allport, 1980). The model provides a possible source for this 
extra overhead : the supervisory attentional mechanisms . Unless the 
second task is extremely well-learned, there will be a need for extra 
source and component schemas that require vertical thread activation in 
their setup and in their selection. Thus, as Allport (1980) pointed 
out, in experiments involving piano playing conducted by Allport, 
Antonis, & Reynolds (1972), the one subject who showed no interference 
"was also the most competent of our pianists." The other subjects all 
found some technical challenge in the music such that "moments of emer- 
gency occurred," where recovery required some relatively unpracticed 
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applications of keyboard technique and therefore, on our model, atten- 
tional resources* 

Attentional demands at rhe initiation of actions . A major theme of 
the model is that attention is used primarily at the initiation of 
actions. This property of the model fits with the results of probe stu- 
dies during the movement, where responses to probes at the start or end 
of movement are more delayed than those daring execution (Posner & 
Keele, 1969; Ells, 1973). Indeed under certain conditions, although the 
start of the response shows the expected interferences, the large stages 
of response execution may not delay responses to probes at all (Posner & 
Keele, 1969). In Interpreting these results It is Important to realize 
that both the starting and stopping of a response requires selection by 
contention scheduling since, at one level, they Involve different sche- 
mas. To stop a physical motor response requires that the limb motion be 
halted, which requires Initiation of the action of muscle groups that 
can counter the momentum of the movement. It is just as hard to stop a 
movement as it is to start it. The exception is when movement is 
stopped by an external "stop." In this case, one would not expect 
attentional effects at the termination of the movement which Is exactly 
what is found. 

Moreover, attentional influences act only to bias the selection 
mechanisms, not to do the selection. One relevant study was performed 
by McLean and Shulman (1978) who found that when attention was first 
directed to the possibility of a signal and then distracted, there was a 
residual bias that remained from the initial investment of attention. 
We believe this finding to be more consistent with the view that atten- 
tion can only bias processing actions than wittj the more usual view that 
at tent ion selects processing . 

A related prediction made by the model is that if the triggering 
potential of one type of stimulus Is mu<;h stronger than that of a second 
type, then the former will be selected in contention scheduling even 
though the latter is being deliberately attended to. In this situation, 
triggering activation is more powerful than activation from the super- 
visory attentional mechanisms. One set of such findings comes from the 
literature on selective attention In which an attempt is made to keep 
the subject concentrating upon a primary task while other signals are 
presented. A classic example of the difficulty of doing this is the 
Stroop phenomenon. Certain classes of words presented upon a secondary 
channel can intrude upon or bias primary task performance, such as a 
word that fits within the context of the primary channel, or that has 
been conditioned to electric shock, or that has high emotional value 
(such as one's own name)* Performance of the other task is impaired 
when the interrupt occurs. In terms of our model, these "intrusions" 
result fron data-driven entry of action schemas into the contention 
scheduling mechanism and their selection there due to the strongly 
activating properties of such triggers. These intrusions, therefore, 
nre similar to the form of action slip found in "capture errors," to be 
discussed later. 
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Experiential Phenomena 

An important aspect of attention and the control of action is its 
phenomenology. The role of the conscious awareness of directed, con- 
trolled attention to events and to actions, and the nature of those 
situations in which one can act without awareness or deliberate inten- 
tion to act, are all of direct relevance to the theoretical structures 
described here. Indeed, the theory was developed with explanation of 
the experiential aspects of attention as much of a goal as the results 
of controlled experimentation. 

Phenomenal reports of action provide evidence for qualitative dis- 
tinctions among different types of experience. We do not discuss the 
correspondence between consciousness and information processing mechan- 
isms and functions (but see Shallice, 1972, 1978). To make differentia- 
tions between types of experience, it is sufficient to assume that the 
supervisory attentional mechanism has the possibility for access to 
information about the schemas selected in contention scheduling, and 
that reflection on action involves this process. 

Automatic performance . In an earlier section we identified several 
different aspects of the term "attention." On the model, there are 
correspondences for all these related aspects of the term. Some 
actions, such as the decision to walk across a certain path, might be 
initiated with directed attention and clear awareness, but the perfor- 
mance need not depend upon the supervisory attentional mechanism nor 
interfere with any other activated schema in contention scheduling; 
hence, the automatic nature of the performance. In other actions, <=uch 
as the brushing away of an insect, even the initiating of the act 
(through data-driven excitation of a source schema) might not depend 
upon the supervisory attentional mechanism. In this case, there could 
be a complete lack of awareness of both the initiation and the action. 
These cases also correspond to the operational definition, for there is 
no demand upon the attentional resources. 

There are, however, cases in which one experiential sense of 
"automatic" does not correspond to "automatic" in the operational sense. 
Thus, the orienting response ij phenomenologically automatic, as there 
is no deliberate attentional control over schema activation and selec- 
tion. However, schema selection may very well interfere in contention 
scheduling with the operation of tasks being performed at the same time; 
in these cases, the operational criterion for automaticity is not met. 

A specially interesting case of automaticity is where at one time 
the action did require conscious direction for its performance, but now 
no longer does so. As William James (1890) pointed out at some length, 
the common denominator of such actions is that they are habitual, fre- 
quently performed, usually in a relatively fixed format. Why should the 
ability to perform such tasks automatically only occur when they are 
well learned? On the model, this is because newly learned actions ar^ 
apt to be ill-specified. Their schemas are relatively small, encompass- 
ing relatively specialized sub-actions. Moreover, their triggering 



No mar /Shall ice Attention to Action 

December 10, 1980 



conditions are apt to be ill-specified, not well matched to the actual 
conditions that occur. As a result, continual monitoring is required by 
the attentioaal mechanisms, and selection must often be forced (or 
delayed) by the application of deliberate attentional activation. 
Well-learned actions are apt to be well specified, with their schemas 
encompassing large, organized units of behavior, and with their trigger- 
ing conditions well-matched to the situation. As a result, once their 
schemas have been selected, they can maintain control effectively for 
longer periods. 

The amount of interference that a task presents to other tasks per- 
formed at the same time will also decrease as learning improves. This 
decrease comes about for three different reasons. First, as the schemas 
become better specified, there will be fewer gaps and fewer weaknesses 
that need the supervisory processes for proper control. Second, as the 
triggering conditions become better matched to the situation, they are 
less likely to need supervisory attentional resources. Third, general 
schemas are apt to make more general demands on processing structures, 
making it more likely that there will be conflicts with other action 
sequences. As the actions become specified more precisely, they involve 
a smaller fraction of the psychological processing structures. This too 
reduces the conflicts in performance, by decreasing the possibility of 
lateral inhibitory conflicts during contention scheduling. 

Contention scheduling without deliberate direction . It is possible 
ro be aware of performing an action without paying active, directed 
attention to it. This corresponds to situations in which the selection 
of a schema is accomplished by contention scheduling without the 
involvement of the supervisory attentional mechanism. The most general 
situat ion of this type is in the initiation of routine actions. 
Phenomenally, this corresponds to the state that Ach (1905) describes as 
occurring after practice in reaction time tasks. Over the first few 
trials, he said, the response is preceded by awareness that the action 
should be uade, but later there is no such awareness, except if prepara- 
tion has been inadequate. By then, the stimulus triggers the appropri- 
ate schema without the involvement of the supervisory attentional 
mechanisms. In well-learned tasks, the subject experiences the response 
as proceeding with "an awareness of determination 11 even if it is not 
immediately preceded by any experience of intention to act. In our 
terms, this is because the schema controlling action had previously been 
activated using the supervisory attentional mechanisms. but then the 
schema selection occurs automatically when the proper stimulus condi- 
tions occur. 



Whenever contention scheduling takes place without any present or 
prior involvement of the supervisory attentional mechanisms, even the 
"awareness of determinat jon" is absent. Data-driven triggers act in 
this way. Because there is no involvement of the supervisory atten- 
tional mechanism, and hence no monitoring, errors can easily occur. IWo 
such errors are the form labelled "data-driven errors" and "capture 
errors" (Reason, 1979; Norman, 1981). Both classes of errors occur when 
the schema that was intended to control action is replaced by another 
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one, leading to an unintentional result. In the case of "data-driven" 
errors, the unintended schema is activated by perceptual information (by 
newly arriving sensory data). In the case of "capture errors," the 
unintended schema shares considerable features with the intended one, 
and in addition, is the more frequently performed action sequence. In 
either case, one may find oneself doing a totally unexpected set of 
actions, nuch to one's own dismay. 

Take for example, Reason's description of the person who went to 
the garage to drive to work and found that he bad "stopped to put on my 
Wellington boots and gardening jacket as if to work in the garden" (Rea- 
son, 1979). Or take the person described by William James who went to 
the bedroom to change for dinner and ended up undressed, ready for bed. 
Students of abnormal behavior may wish to point out that there is seldom 
a single explanation for behavior, that there are many interacting 
causes. Thus, the person who found himself gardening might also have 
(subconsciously) wished to avoid going to work. So too with the person 
described by James; the dinner may have been unwelcome. Our contention 
scheduling system is deliberately designed with these issues in mind. 
We postulate that selection results from the combination of numerous 
factors. Thus, it is quite possible that the sight of the gardening 
boots or the act of undressing (to change one's clothes) would not by 
themselves have been sufficient to have selected the discrepant 
behavior. Similarly, the hidden wishes, whether conscious or not, would 
not by themselves have been sufficient to cause the behavior. But the 
fortuitous combination of the wishes and the situation were sufficient 
to cause selection of the schemas . 

Deliberate conscious contrcl . A critical separation on the model 
is between action initiated through contention scheduling without the 
involvement of the supervisory attentional mechanism, and action ini- 
tiated with the involvement of this mechanism. This distinction 
corresponds closely to William James's (1890) distinction between 
"ideo-motor" and "willed" acts. To James, "wherever movement follows 
unhesitatingly and immediately the notion of it in the mind, we have 
ideo-motor action. We are then aware of nothing between the conception 
and the execution." These "ideo-motor" actions (a category which does 
not exclude "awareness of determination") corresponds directly to our 
idea of contention scheduling without conscious direction. His concept 
of actions involving "an additional conscious element in the shape of a 
fiat, mandate, or expressed consent" corresponds to cases where we 
believe the supervisory attentional mechanism to be operative. 

Kxperient ially, a number of different sorts of tasks appear to 
require a considerable amount of deliberate attentional resources. 
These tasks fit within the following categories: 

(a) they involve planning or decision-making, 

(b) they involve components of trouble shooting, 
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(c) they are ill-learned or contain novel sequences of actions, 

(d) they are judged to be dangerous or technically difficult, 

(e) they require overcoming a strong habitual response or resist- 
ing temptation. 

The general p Inciple involved is that these are special situations in 
which the uncontrolled application of a horizontal processing thread 
through the contention scheduling mechanism is apt to lead to error. 
The supervisory attentional mechanisms allow more control over the 
sequence of actions to be performed thsn is possible through horizontal 
thread direction alone. 



Vertical thread influences are necessary because individual schemas 
are limited in what they can foresee and control- Further, although the 
contention scheduling mechanism allows selection among schemas, it does 
not allow lor integration of information across them. Given the variety 
and power of human capacities, it would appear that mechanisms must 
exist which have available to them information about the varied needs 
and capacities of the organism, including source schema, the ability to 
monitor the operation of schemas, and the power to initiate the con- 
struction of new sche aas out of existing ones. 

Planning and decision making are processes that operate in the for- 
mation of intentions that are not routine. In our terms, we plan or 
decide when it is clear that no existing schemas are sufficient to 
satisfy a particular goal. In these cases, information must be from 
more than one schema, or new schemas must be formed. This requires 
involvement of the supervisory attentional mechanisms. We assume 
though, that these general powers are bought at the cost of speed. Use 
of the supervisory attentional mechanisms then provides both benefits 
and costs. The benefits derive from the increased processing power 
brought to bear on the problem at hand; in general, judgments which it 
controls will be superior to the unguided selections of contention 
scheduling. The costs result from the slowness, leading to difficulty 
in the control of rapid, skilled actions, and to seriality in the con- 
trol of what would otherwise be parallel acts. 

We define trouble shooting to be the application of planning z u d 
decision-making processes to actions already in progress. It occurs 
when an unexpected error occurs in the operation of an action (see 
Handier, 1975). When a particular, specialized component schema has 
failed, one solution is to replace it with a more general one. More 
general schemas are apt to require selection through contention schedul- 
ing and vertical thread control by the supervisory attentional mechan- 
isms. 

The performance of ill-learned or novel skills requires what Fitts 
and Posiier (1967) called "the early or cognitive phase • in which it 
is necessary to attend to cues, events, and responses ,ut later go 
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unnoticed" (pp. 11-12). These are situations which require attentional 
control because neither appropriate schemas nor their triggers have been 
developed. With dangerous and technically difficult situations, error 
is relatively costly. In this situation, one wishes to guard against 
the vagaries of contention scheduling and enforce selection of the most 
appropriate scnema by means of strong activation. Similarly, in over- 
coming habitual responses or in resisting temptation, the appropriate 
schema must be strongly activated and the others strongly inhibited, 
else a more usual, but inappropriate act, might be selected by the nor- 
mal operation of contention schsduling. 

There ^an be costs in using deliberate control in a task that is 
normally performed automatically* Activation of schemas in contention 
scheduling by the supervisory attentional mechanism has the effecc of 
reducing the influence of activation from triggering stimuli and ottier 
schemas in the selection process: subtle environmental control is lost* 
In addition, action execution must proceed unit by unit, each schema 
awaiting its turn for receiving attentional biases. The result is a 
lack of smoothness and a slowing of performance • 

Neuropsychological Phenomena : The Frontal Lobes 




There are strong correspondences between f actions of our super- 
visory attentional mechanism and those ascribed by Luria (1966) to the 
prefrontal regions of the brain. If the supervisory attentional mechan- 
ism were damaged, the resulting behavior would be similar to the 
behavior of patients with lesions to the prefrontal regions. 

A deficit in planning corresponds to Luria's clinical characteriza- 
tion of the frontal lobe syndrome, which has been supported in a number 
of experimental studies involving maze learning, complex visual- 
constructive tasks, and complex arithmetical problem-solving (see Walsh, 
1978). The simplest example of a planning disorder is the finding of 
Gadzhiev (see Luria, 1966) that frontal patients when presented with a 
problem tend to miss out the initial assessment of the situation. Tron*- 
tal lobe patients have also been characterized clinically as having 
deficits in initiative, in dealing with novelty, and of judgment (Pen- 
field & Evans, 1935; Goldstein, 1936). 

Patients with frontal lobe lesions have difficulties with error 
correction. The Wisconsin card-sorting test involves multi-dimensional 
stimuli where the patient must switch from sorting according to one 
dimenbion to sorting according to another. In this task frontal 
patients show a strong tendency to perseverate in sorting on the previ- 
ously correct dimension, even when they are told they are wrong (Milner, 
1964; Nelson, 1976). Turret (1974) found that patients with frontal 
lobe lesions are the most Impaired group on the Stroop test. This is a 
task in which the usual response to a stimulus is not the desired one — 
habitual responses must be suppressed. In this situation deliberate 
at tent ional control is required , hut this in general presents especial 
difficulty for frontal lobe patients. 
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The failure to overcome an habitual response tendency is one side 
of the general effect that should occur on our model if the supervisory 
attentioual mechanisms are damaged* In this case, behavior will be left 
under the control only of the horizontal thread structures, plus content 
t ion scheduling . In the examples above where one schema is more 
strongly activated than the others, it will be difficult to prevent it 
from controlling behavior* By contrast, when several schemas have simi- 
lar acti\ation values one should obtain another clinical characteristic 
of frontal patients: an instability of attention and heightened dis- 
tractability (see Walsh, 1978). This apparent contradiction between 
increased perseveration and increased distractabil ity resul ts from 
failure of a single mechanism. Both results are observed in animals 
with prefrontal lesions (see Brush, Mishkin, & Rosvold, 1961; Pribram, 
1973). 

If the properties of the supervisory attentional mechanism seem to 
correspond fairly well with neuropsychological evidence , does the same 
apply to the properties of contention scheduling? One possible relation 
is between the lateral inhibitory and threshold properties of contention 
scheduling and certain properties of the basal ganglia, thought because 
of their role in the aetiology of Parkinson's disease to be involved in 
the initiation of action (see also Stein, 1978). Moreover the basal 
ganglia are innervated by dopamine systems, which it has recently been 
claimed mediate the selection of behaviors through a lateral inhibitory 
mechanism somewhat analogous to contention scheduling (see Joseph, 
Frith, & Waddington, 1979) and which when they malfunction (as in amphe- 
tamine psychosis) lead to disorders which could well be at the level of 
the selection of action (see Lynn & Robbins, 1975). 

Will 



We propose that n will n be the direction of action by direct cons- 
cious control through the supervisory attentional mechanism. This 
definition is consistent both with the popular meaning of the term and 
with the discussions of will in the earlier psychological literature. 
Thus, strongly resisting a habitual or tempting action or strongly forc- 
ing performance of an action that one is loathe to perform seems to be 
prototypical examples of the application of will. The former would 
appear to result from deliberate attentional inhibition of an action 
schena, the latter from deliberate activation. James (1890) drew the 
contrast between "what happens in deliberate action 11 where will is 
involved and actions that do not require will, where the responses fol- 
lowed "unhesitatingly and immediately the notion of it in the mind" 
(ideo-motor actions). Situations in which there is no need for will are 
tho^e where there "seems to be the absence of any conflicting idea." 

In our view, will varies along a quantitative dimension correspond- 
ing to the amount of activation or inhibition required from the super- 
visory attentional mechanisms. The assumption that this activation 
value lies on a continuum explains why the distinction between willed 
and idco-niotor actions seems quite cl ear when considering extreme 
actions, but becomes blurred when considering those that require very 
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little atrentional effort. Thus, introspection fails in determining 
whether or not will is involved in the voluntary lifting of the arm. 
But there is no need to make a distinction if this act is simply identi- 
fied as bein£ near the zero point of the quantitative scale of atten- 
cional activation. 

The idea that will corresponds to the output of the supervisory 
attentional mechanisms has certain other useful consequences. Consider 
the errors that occur with brief lapses of attention, when there is a 
failure to sustain will adequately. One type of error results following 
a decision not to do a step within a habitual sequence of actions. To 
eliminate the step requires deliberate (willful) inhibition of the 
relevant schema. If there is a momentary lapse of attention to the 
deliberate inhibition, the step may get done anyway. Closely related is 
the error that occurred to one of us. Having decided not to take 
another bite of a delicious, but extremely rich dessert, with only a 
brief lapse of attention, the cake got eaten. 

Certain aspects of will require elaboration of our approach. In 
some circumstances an action may seem to require no will at all, yet at 
other times, require extreme demands. Thus, getting out of bed in the 
morning is at times an automatic act, at other times requires great 
exertion of will. One explanation for this observation is that activa- 
tion of an action schema by the attentional mechanisms necessarily 
involves knowledge of consequences. When these are negative, they lead 
to inhibition of the source schemas which then must be overcome. In 
some cases, the self-inhibition can be so intense as to prevent or at 
least make very difficult the intended act. Thus, inflicting deliberate 
injury to oneself (as in pricking one's own finger in order to draw 
blood) is a difficult act for many people. 

The elicitation of strong activation from the supervisory atten- 
tional mechanism is not necessarily unpleasant. Indeed, many sports and 
games seen to be attractive because they do necessitate such strong 
activation. In this case "concentration" is perhaps the more appropri- 
ate experiential equivalent rather than "will." In addition, will is 
not just a matter of attention to actions. As Roy D'Andrade (personal 
conmunicat ion) has oointed out, a willed act demands not only strong 
attentional activation, it also depends on the existence of a "mandated 
decision," independent of one's attending, a conscious knowledge that 
the particular end is to be attained. This mandate, in our view, would 
be required before the supervisory attentional mechanisms will produce 
their desired activation output. However the critical point for the 
present argument is that the phenomenal distinction between willed and 
ideo-motor acts flow from separation of the supervisory attentional 
mechanisms from the systems they oversee. The phenomenology of atten- 
tion can be understood through a theory of mechanism. 
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Summary 

W6 present a possible framework for considering the role of atten- 
tion in the control of action. In this, we have emphasized several 
things. First, because people usually do numerous activities during a 
given time period, a major concern for the control of action becomes how 
the selection of the individual component*: occurs at appropriate times, 
allowing co-operative actions to co-occur and avoiding conflicting ones. 
Second, there is the importance of the initiation of action. We assign 
the basic role of the attentional mechanisms to the initiation of action 
(as opposed to perceptual analysis, — where the bulk of the experiments 
have been performed — or to thought ar"l decision processes). By 
"action" we include the initiation of both internal processing actions 
and external control of effectors. Third, we emphasize that many 
activities can be carried out autonomously, without the need for cons- 
cious or attentional control, by means of well-specified, horizontal 
thread processing components. It is only in cases where the action 
sequences are ill-specified, or in situations that are judged to be 
critical or dangerous that deliberate attentional control is required. 
In this case, we suggest that supervisory attentional mechanisms of lim- 
ited capacity oversee the operation of the system, monitoring for the 
success of the activity, and biasing the selection and suppression of 
component schemas by altering the activation values of those schemas. 
We specify that such attentional control does not act directly, but only 
indirectly through the mediation of activation value. 



By this scheme* there are two forms of interference likely to be 
encountered in the production of simultaneous tasks. The two forms 
correspond to (a) interference among horizontal threads when they must 
compete for use of overlapping processing mechanisms and to (b) 
interference among the vertical thread activations when they must be 
produced by the supervisory attentional mechanism. The first form of 
interference — horizontal thread interference — is related to "struc- 
tural interference, 11 but on our approach this is mediated by the opera- 
tion of a mechanism, contention scheduling, that selects from poten- 
tially competing actions using only the activation levels of the schemas 
that control them. The second form of interference — vertical thread 
interference — is more a form of "resource interference." 

There are two different modes for the control of action and, like 
the distinction between forms of interference, they also correspond to 
the difference between horizontal and vertical thread control. Thus, 
when processing sequences are sufficiently well specified that they can 
be controlled entirely by horizontal thread operations, they correspond 
to "automatic" actions. When conditions do not permit unsupervised hor- 
izontal control (or when the person deliberately invokes attentional 
processes to the action sequence), then the operations correspond to 
processing under "conscious control" or "willed" action. 
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MULTIPLE RESOURCES IN TIME-SHARING 
Chris Wickens 
University of Illinois 

Recent research has addressed the question of whether human attention, 
or information processing resources can be effectively modelled as a multi- 
dimensional commodity (e.g., Navon & Gopher, 1979; Wickens, 1980). The heuris- 
tic model that we have proposed, as a framework for hypothesis formulation and 
test of the multiple resource concept suggests that processing resources are 
defined by processing stages at two levels (perceptual/central processing 
versus response processes) , processing codes (verbal versus spatial) , and 
processing modalities (auditory versus visual input, vocal versus manual 
response) . 

If idated, this configuration has both theoretical implications and 
practical implications for systems design and assessment. These will be ad- 
dressed in turn, followed by a discussion of validating experimental data 
collected in our laboratories. 

Theoretical Implications 

If separate resources do underlie processing, tnen two implications fol- 
low directly: (1) Two tasks will be more efficiently time-shared to the extent 
that they demand non-overlapping resource "pools. 11 Whether perfect time- 
sharing will result if the two pools are completely disjunctive, depends upon 
the demand (task difficulty) of the component tasks. (2) When the difficulty 
of one member of a time-jhared pair is increased, its effect upon performance 
of the concurrent task will be a function of two factors: (a) the extent to 
which the subject treats the manipulated task as "primary" (tries to maintain 
constant performance despite the demand increase) ; (b) the extent to which the 
resources consumed by the demand increase sure also those deployed in perfor- 
mance of the concurrent task. (Note that the effects of difficulty may be used 
interchangeably with the effects of automation or practice, since this variable 
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may also have its influence on separate capacity pools. That is, automation 
of perceptual processing is functionally equivalent to a decrease in percep- 
tual difficulty. Development of a motor program is equivalent to a decrease 
in response difficulty.) 

While the two assertions above represent a skeletal statement of the 
elements of multiple resource theory, it is necessary also to consider in 
more detail the defining properties of a "resource" (in a time-sharing con- 
text) , and to emphasize the distinctions that can be drawn between a multiple 
resource viewpoint, and the conception of a single resource with structural 
interference (Kahneman, 1973) . A resource here refers to any commodity that 
can be divided or shared between tasks, and whose division can be modulated 
in continuous fashion, according to an allocation policy. In contrast, 
a dedicated processing structure must service only one task at a time, and 
therefore behaves in a two-state all-or-none fashion. (This contrast leaves 
ambiguous, the designation of a multiplexing system that can rapidly switch 
between two tasks, and whose dwell time can be adjusted in continuous analog 
fashion. Obviously, at a high enough switching frequency the dedicated pro- 
cessing mechanism becomes empirically indistinguishable from the resource) . 
At issue then is whether there exists only one commodity with resource- like 
properties, or a number of such commodities. The POC (performance operating 
characteristic) methodology (Ncrman & Bobrow, 1975 * Navon & Gopher, 1979) has 
provided a convenient framework for comparing these hypotheses. 

If a multiple resource framework is adopted (as I have done here) , it is 
necessary to specify the composition, or functionally defining properties of 
the resource pools, and the interrelationships between them. Assuning as I 
have done that resources may be defined by stages, codes, and modalities 
(Figure 1), (Wickens, 1980), three alternative conceptions are possible. 
(1) An independenc e conception argues that the resources within each cell of 
the matrix in Figure 1 are disjoint and independent from the resources in all 
other cells. This assunption seems to be clearly untenable, as for example 
numerous experimental findings support the conclusion that auditory and visual 
processes will compete, as will spatial and verbal (albeit to a lesser extent 
than auditory- auditory, visual-visual, or spatial- spatial and verbal-verbal). 
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Thus it is necessary to asstme that some capacity is shared across the 
"pools" of Figure 1. This may be in the form of a single "general" capacity, 
equally available to all processes, or a (2) more hierarchical structure 
represented in Figure 2. Adopting a hydraulic metaphor, we may assime that 
resources above a boundary may transfer across the boundary, but those below 
may not and exclusively serve the indicated process. If this conception is 
adopted, it becomer important to specify the dominance ordering of the hier- 
archy. For example, Figure 2 suggests that each code (spatial vs. visual) 
has its own exclusive pool of auditory and visual resources. The implication 
is that an auditory spatial and an auditory verbal task will be time-shared 
as efficiently as a verbal <xu A . spatial task of opposite modalities. 

This prediction runs somewhat counter to our intuitions, which suggest 
instead (3) a kind of a "shared features" conception . Two tasks will inter- 
fere to the extent that features (dimensions of resource) are shared *>etween 
them. Maximum interference will occur when there is complete Teature overlap; 
minimum when there is none, and intermediate levels when one or two features 
are shared. Such a conception may be represented "hylraulically" in Figure 1, 
by assuming semi-permeable boundaries separating pools. In this conception 
one needs again to establish the dominance ordering of the different dimen- 
sions (the degree of permeability of the three orthogonal boundaries of 
Figure 1). In time-sharing perceptual encoding tasks, is it more important 
that modalities be different between tasks? Or processing codes? Or is there 
an interaction between the shared features such that, for example, shared 
modalities make a difference within a code, but not between? 

There exists a considerable quantity of experimental data that supports 
the separate resource-like properties of the three dimensions (Wickens, 1980). 
However, some of this data is flawed by confounds with task difficulty, 
(e.g. , an intra-modal time sharing combination may involve more difficult 
task components than a cross-modal), or peripheral interference (e.g., a 
visual-visual task pairing may induce a degree of visual scanning, not en- 
countered in a cross-modal combination, thereby biasing t J me-sharing efficiency 
in favor of the latter). Finally, studies have not extensively examined re- 
source dimensions orthogonally in pairs, or triples. 
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Practical Implications 

The multiple resources concept ha3 four important implications to human 
performance in complex settings: These relate tc workload, task integration, 
task structuring, and strategies. 

Workload . If resources are a vector quantity and mental workload is 
conceptualized as the demands imposed by tasks for the operator's resources, 
then effective workload measures must also be vectors. Alternatively, for 
workload measures that are scalars (e.g., subjective ratings, physiological 
measures) , their derivation from (or mapping into) the vector resource con- 
cept must be specified. For example, if a systems designer derives a subjec- 
tive difficulty index, is this index tapping perceptual load? Response load? 
Or come contribution of each? 

Task Integration . Most directly the structure- specific resources concept 
suggests that in high information flow environments (the overloaded ground 
controller, or jet pilot), greater time-sharing efficiency will be achieved 
when demands are shifted from over- to under-utilized resource pools. This 
option is increasingly available, as computer technology has provided great 
flexibility in display and control options (e.g., voice technology for both 
display and response capabilities). 

Task Structuring . There is good experimental evidence that certain codes 
or modes of processing are more naturally associated with each other, than 
others (Greenwald, 1971). For example the manual response system represents 
a more compatible output front spatial processing than verbal. Thus when a 
designer considers the task to which for example voice response, or auditory 
display will be assigned, part of that decision should be dictated by whether 
certain natural compatibility relations are either confirmed or violated with 
a particular selection. 

Strategies and Training . If resources meet the criteria defined above, 
then they are capable of differential allocation according to the operator's 
conscious choice. One implication of the multiple resources model — dealt 
with in some detail by Nave i and Gopher — is that the operator has consider- 
able flexibility in how resources from various pools, are allocated to various 
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tasks. For a given task pair, different allocation policies may lead to 
varying degrees of time- sharing efficiency, and only one particular choice 
may optimize performance. Correspondingly, a component task may be performed 
in various ways that will differentially load different pools (and thereby 
differentially impact performance of a concurrent task). A short-term 
memory task, for example , may be performed by emphasizing phonetic versus 
semantic codes. Signal detection may be varied by changing sensitivity or bias 
parameters. The supervision and control of dynamic systems may be exercised 
by emphasizing spatial or prepositional (verbal) logic. Recognition of the 
role of strategies in influencing dual task performance thus has bearing 
both upon training (the extent to which specific time-sharing strategies may 
be trained) and upon selection (potential differences in the availability of 
these strategies between people) . 

Experimental Results 

A large portion of our research on multiple resources has centered around 
the manual tracking paradigm. This task is advantageous for two important 
reasons: (1) It is a relatively ubiquitous paradigm that represents a major 
component of an aviator's task. Yet with only minor changes in certain experi- 
mental parameters of the task, the paradigm can be modified to simulate the 
slower dynamics of ship control, or the more complex transfer functions charac- 
teristic of chemical or process control. (2) It is a task that can provide 
a relatively continuous and rich supply of experimental data, which is able 
to indicate shifts in the subject's processing strategy. Furthermore, the 
task is complex enough that a number of different parameters, or chracteris- 
tics, can be rystematically manipulated, to impose differing loads at all 
le/els of the processing system. Our general experimental approach has been 
to use wel -trained subjects, and allow them to track under single task con- 
dition*, arid under a vai ety of dual task conditions during which the diffi- 
culty (resource demands) of tracking, and of the concurrent task, is varied. 
Variance in time-sharing efficiency induced by these manipulations is then 
employed to make inferences both about the nature of the tracking task itself 
and about the functional composition of processing resources. This "boot- 
strapping" operation is substantiated by converging data from other sources. 

The following description of experimental result* will be organized around 
our experimental manipulations (some of which were performed in more than one 
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different experiment). These concern tracking order, tracking bandwidth, 
display modality and hand assignment. 

TracKing Order . The order of a tracking task refers to the number of 
time integrations between a control input (exerted by the subject in response 
to a perceived error) and the response of the plant. Generally, first order 
(1 integral) or velocity control systems are fairly easy to control. Second 
order (2 integral) or acceleration systems are sluggish, unstable, and diffi- 
cult. Second order dynamics are characteristic of many aviation systems with 
high inertia. The second order task may be likened to balancing a ball bear- 
ing in the center of a flat tray, as an unseen force continuously tilts the 
tray off the horizontal. Generally, there are two different strategies of 
coping with second order dynamics. A "perceptual" strategy entails heavy 
anticipation, as the operator responds directly to higher derivatives (velo- 
city and acceleration) of the error signal. A "response" strategy entails the 
execution of a coordinated "bang-bang" double impulse control, to nullify an 
existing error. 

Using dual task data from two converging sources, we have found that well- 
trained operators tend to demonstrate a perceptual- loading strategy. In one 
investigation, siy subjects performed first and second order tracking while 
evoked potentials to a Bernoulli series of auditory tones were recorded 
(Figure 3) • Subjects covertly enumerated occurrences of the rarer of the 
two tone pitches. Following procedures described elsewhere (Isreal et al. 
1980 a, b) , the amplitude of the P300 component of the ERP was examined as an 
index of the perceptual/central processing load imposed by second order track- 
ing. In fact, the amplitude of P300 declined from no tracking, to first, to 
second order control, allowing us to infer that the increase in order was one 
that demanded resources from a pool utilized to generate the P300 (Figure 4) • 
Since P300 was elicited without any response requirement, it is doubtful that 
this pool was related to response processes. Reaction time, collected c^/.mr- 
rently with tracking, in a different paradigm rose in a corresponding fashion 
with tracking order, validating that the order increase did demand more re- 
sources (Figure 4, top). 

An alternative interpretation to these data is that resources are undif- 
ferentiated, that P300 would be sensitive to any demand manipulation, and 
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therefore that second order tracking does not necessarily impose a greater 
perceptual/central load but its load is unspecified. Two lines of evidence, 
however, argue against this view. One employs a different second task and 
the other a different tracking task manipulation. These shall be considered 
in turn. 

A second, control-order manipulation experiment was conducted in which 
a Sternberg memory-search task was imposed as a concurrent task (Wickens et 
al, 1980). In orthogonal manipulations, control order of tracking was adjusted, 
and the Sternberg task was rendered more difficult (prolonged) by (1) imposing 
a display mask, (2) increasing the memory set size (from two to four times), 
and (3) increasing the complexity of the response (from a single to double key 
press) . The first two of the Sternberg manipulations were intended to demand 
resources from the perceptual/central pool, and therefore to be more disrupted 
by any added perceptual load inherent in second order tracking. The double- 
response load manipulation should be sensitive to any increase in response 
demands by second order tracking. 

The results, shown in Figure 5 (top) confirm that tracking order and mani- 
pulations of perceptual and central load interact (reliably in the former easel , 
while order is additive with response load. (In fact, the total absence of 
effect in the latter case — at first surprising — is explainable when track- 
ing error is examined. This variaole increases with the double response, but 
does so to an equal extent with first and second order dynamics.) These re- 
sults then are consistent with the results of the ERP experiment in locating 
the demands of second order tracking, as performed by these subjects — at ear- 
lier processing stages. 

Tracking Bandwidth 

In a series of three experiments, we have manipulated the bandwidth, or 
upper cut off frequency of the random input forcing function which the subject 
must nullify in performing the task. Essentially, this .nanipulation varies 
the "speed stress" or the number of decisions and motor commands per unit time 
that must be initiated. Like control order, this manipulation renders the task 
subjectively more difficult, increases tracking error, and will sometimes gen- 
erate greater interference with a concurrent task. Yet its effects appear to 
be qualitatively different from control order. 
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When bandwidth was manipulated concurrently with the ERP and reaction 
time tasks as above (Isreal et al. 1980a) , the results shown in Figure 6 in- 
dicate that RT performance declined systematically (and reliably) , whereas 
P300 amplitude was unaffected. These data emphasize that P300 s selective in 
its resource sensitivity and is not affected by any difficulty manipulation 
of a concurrent task. 

In considering these results, it was hypothesized that the locus of de- 
mands imposed by the bandwidth increase was on the response pool of resources. 
However, the pattern of results of two further bandwidth manipulations suggest 
that this interpretation may not be entirely correct. 

When bandwidth was manipulated concurrently with the Sternberg task 
(Wickens et al., 1980, Figure 5, bottom), there is no suggestions of an inter- 
action with any processing stage. (The response load hypothesis would suggest 
that an interaction would be observed with the double response condition.) 
Furthermore, there is no suggestion that increased bandwidth had any effect at 
all upon reaction time, despite instructions to the subject to maintain track- 
ing as the primary task, and therefore to cast all variance due to dual task 
interference into the RT performance. 

A similar pattern of results was observed by Wickens and Harris (Wickens, 
1980) in an experiment in which bandwidth was manipulated concurrently with a 
running-memory mental arithmetic task in which subjects computed the absolute 
difference between successively presented digits. Stimuli for the latter task 
were presented in either the visual or auditory modality, and responses were 
generated either manually or vocally. The results (Figure 7) expressed in 
terms of a total dual task decrement measure indicated (a) no main effect of 
bandwidth (that is, increasing bandwidth did not increase the competition for 
resources with the arithmetic task) and (b) no interaction of bandwidth with 
input or response modality. Had the locus of bandwidth increase been at re- 
sponse, we might have anticipated a greater effect in the manual, as opposed 
to the vocal, response condition, since in the former condition the output 
channel is shared by the the two tasks. 



In fact, a continuous range of bandwidths was assayed, bandwidth being modu- 
lated upwards and downwards over time. 
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The results of the latter two studies were somewhat surprising. Collec- 
tively they suggest that the resources consigned by increasing bandwidth are 
not utilized in the Sternberg or mental arithmetic task. One possible clue 
to the interpretation is provided by the fact that both of these tasks are 
verbal in their processing requirements (encoding and searching letters of 
the alphabet, and retaining and subtracting digits respectively). The RT 
task employed in the ERP bandwidth experiment vas non-verbal, requiring only 
the pitch discrimination of two tones. This might suggest that the resources 
underlying bandwidth are spatial in nature (an increase in the rate at which 
spatial uncertainty needs to be resolved) . This consideration leads to a dis- 
cussion of two further experiments: one related to the relation between the 
spatiality of tracking and processing modalities, and the second to the rela- 
tion between spatiality and hand of control. 

Display Modality 

If tracking truly requires spatial processing, then it might be expected 
to interfere to a greater extent with a second task that also requires spatial 
processing, than with one that does not. A related issue concerns the rela- 
tion of spatiality to modality. Normally the visual modality is associated 
more directly with spatial processing, and most demonstrations of spatial 
interface occur with visual tasks (but see Baddeley and Liebennan, 1980) . 
An experiment by Isreal (1980) therefore asked whether the spatial aspects of 
tracking could be unconfounded from the modality aspects, to establish if a 
common spatial processor transcends across modalities. 

In Isreal* s experiments, nine subjects performed a one-dimensional com- 
pensatory tracking task that was displayed either visually or auditorily. In 
the latter condition, "error" was displayed by a tone that varied redundantly 
in pitch (proportional to absolute error) and apparent spatial location (pro- 
portional to signed error). The subject therefore tracked in such a way as 
to maintain the tone in the mid plane of the head, and at the lowest possible 
pitch. Because of the lesser degree of familiarity with this display, subjects 
were provided twice as much practice as with the visual task, and the auditory 
bandwidth was adjusted to a lower level so that RMS error obtained on the two 
displays was equal. 
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Concurrently, subjects performed a disjunctive go/no-go reaction time 
task, presented in either the auditory or visual modality. This required 
either a spatial discrimination (of a tone in the left right ear, or a bar 
on the left or right side oC the display) , or an intensity discrimination (a 
low intensity or high intensity bar flash, or a soft or loud tone) . Stimulus 
discriminability was adjusted so that single task RT in all conditions was 
equivalent but for a 40 msec difference favoring the auditory modality. This 
difference accounted for the greater peripheral transmission latency of the 
visual modality. 

Figure 8 presents a measure of total dual task interference (combining 
decrements in RT latency and accuracy and tracking error from their respective 
single task values) . A number of reliable trends are evident from the data. 

(1) There is an overall "spatial effect" since the decrement with the spatial 
probes is enhanced relative to that of the intensity discrimination probes. 

(2) This spatial effect is enhanced when probes appear within the same mo- 
dality as the tracking task (the four outermost probes of the figure) . (3) 
There remains a reliable cross-modal spatial effect (the four inner points). 

(4) Intramodal interference is greater in the auditory than the visual modality! 
de^pit the equivalence of single task tracking performance. The detailed in- 
terpretations of these results are presented elsewhere. However, they have 
some important implications both with regard to tracking and to the structure 
of resources. The confirmation of the spatiality of cracking is perhaps 
not surprising, but it is important to note that this effect is unequivocal 
even when all other characteristics of the spatial and intensity probes were 
equated (including their response component ~ a non-spatial press of a sirg±e 
button) . The cross modal spatial effect confirms the existence of modality- 
free spatial processing (Badd^ley and Lieberman, 1980), but its enhancement 
within the modalities suggests a modality-specific component to spatiality 
as well. Concerning a dominance relationship between shared modalities i 
shared spatial processing as outlined earlier, the answer appears to depend 
upon the tracking molality. When visual tracking is employed, the *^tia] 
effect clearly outweighs the modality effect. Yet with auditory tj .eking, the 
relation appears to be reversed. 

These latter results suggest an asymmetry either between tracking tasks, 
or modalities. It is possible that the auditory task — despite its initially 
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greater practice, easier level and equivalent performance — has a greater 
resource limited region than the visual task and is therefore susceptible to 
greater interfering effects. This is due perhaps to a less natural association 
between that modality and spatial processing. Alternatively, the auditory mo- 
dality itself may have less time-sharing capacity, independent of the task 
characteristics. This interpretation is consistent with a similar tre-J to 
that observed in Figure 7, when Treisman and Davies' (1973> data from an 
analogous experiment with verbal material are replotted in an equivalent man- 
net. A related finding, also consistent with Treisman and Davies 1 results and 
those of Hunt and Lansman is that in cross-modal conditions, the auditory task 
always suffered the greater decrement, independent of whether this was the 
tracking or the reaction time task. 



Hand Assignment 

The previous study went further to suggest the spatiality of tracking, but 
did nothing to associate that spatiality with" the possible residence rf spatial 
processors witlJ.n the right cerebral hemisphere, as suggested by a wealth of 
experimental and clinical data. Indeed, functionally it may make little dif- 
ference where spatial processing "resides" as long as its resource demands are 
somewhat disjoint from verbal processing. Nevertheless, the data from the 
fourth experiment (Wickens and Sandry, 1980) suggest some hemispheric lateral- 
ization of the tracking task, and go further to indicate a potential advantage 
in time sharing realized when certain codes of processing, or resources, are 
associated with each other in a compatible relation. 

In this experiment, we proposed a condition of "task- hemispheric integrity" 
to exist when the processing and response functions of a single task are carried 
out within a single cerebral hemisphere (e.g., a spatial processing task is 
responded to with the left hand, and a verbal one with the right hand, or with 
a vocal output). The hand assignment that generates integrity may be unlikely 
to facilitate single task performance (and indeed may hinder it) , but under dual 
task conditions when a verbal and spatial task are shared, this assignment is 
predicted to maximize efficiency, because the processors within each hemisphere 
are r dy involved with a single activity. 

Eight subjects tracked with either the left or right hand alone, and con- 
currently with a Sternberg Memory Search Task. The latter was also responded 




50 



with each hand in turn. Single and dual task performance on each was assessed 
as a function of hand assignment. The basic comparison contrasting dual task 
efficiency in the integrity condition (tracking left, Sternberg right), with 
the non-integrity condition, revealed the former, with practice, to be relia- 
bly superior. A second experiment replicated this design with a M spat 4 al" 
alphabet of 26 random dot patterns. Since Loth tasks here were presunably 
spatial, little difference due to hand assignment was expected, and none was 
in fact observed. Tentatively, the results of this study provide the spatiality 
presumed to underlie processing in the tracking task with an anatomical resi- 
dence, by virtue of the assumed ipsilateral control of respond hand. 

Conclusion 

The experimental data reported here only begin to address the complexity 
of tracking demands and of the resources underlying humap information proces- 
sing. In particular, these have not addressed the potential role of strategies 
alluded to above either as these alter performance of a task (e.g., employing 
a response strategy with second order tracking to reduce interference with a 
cognitive task, or using verbal-propositional representation of complex system 
dynamics to avoid interference with a, spatial task) , or strategies are used to 
adjust the resource allocation policy. The research of Gopher and his col- 
leagues suggests that important benefits to dual task performance can accrue 
with training on this adjustment. Also not addressed has been our work on the 
implications for task workload assessment. 

I believe that the data presented here, along with the converging evidence 
from nunerous other laboratories, support the assertion that a multi-dimen- 
sional model of resources can account for a large proportion of variance in 
laboratory data from time-sharing experiments, as well as an understanding of 
the way in which manual control tasks are performed. The important issue that 
remains, however, is the extent to which these data have a bearing upon the 
pilot's performance in the air. Does he truly time-share? Or, in fact, is 
processing typically "single channel" in the real world (Moray, 1980). Alter- 
natively, does the proportion of variance accounted for by structural differ- 
ences reduce to a fairly trivial level, in relation to variables such as time 
pressure, or task complexity (e.g., the demands within a pool)? As so often 
is stated, answers to these questions awa< t empirical validations. 
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A COMPUTATIONAL THEORY OF THOUGHT 
APPLIED TO TRADITIONAL PROBLEMS 



EARL HUNT 
THE UNIVERSITY OF WASHINGTON 



Thinking is the act of manipulating an internal representation of 
th« external world < Johnson-Laird, 1980 } Nenell* 1980) . Tb e goal of 
cognitive science is to discover the for* of this representation* 
Thinking is also an act that Must be done by something. Thus a 
cognitive science theory of thinking Must Meet two requirements* it 
Must be an accurate reflection of the environMent as seen by the 
thinker * and it Must be a syMbol systeM that can be realized by the 
thinker's brain* a particular physical organ* Both requirewents 
restrict our thinking* but opinions differ Markedly concerning the 
relative iMPortance of each type of restriction* At one extreMe* the 
computer siMulators argue that if we can find just one syMbol systeM 
that can solve the probleMS that people do, then that systeM is 
iMMPdiately a candidate psychological theory (Newell* 1973). This 
iMplies that the environMent is extreMely restrictive* and that what 
we should do is exawine invar iances in our response to it* An 
alternative view is that the environment can be represented in Many 

53 

ERJC 61 



Lake Wilderness Attention Conference 



ways* If this is true, it will be More efficient to determine uhat 
basic functions the brain uses in constructing a representation* 
Host of experimental psychology follows this tradition* Finally* 
Physiologically oriented psychologists argue that the physical nature 
of the brain restricts the possible representations it can construct* 
If this is true, cognition can be understood only after we understand 
the physiology of the brain* 



Our beliefs about the importance of each of these restrictions 
determines what sort of thinking we choose to study* Cognitive 
science studies, growing out of artificial intelligence and 
linguistics traditions focus on invar iances, while experimental 
psycholoqy focuses on the relation between changes in the environment 
and changes in performance* While, in principle, the physiological 
approaf?h could be applied to the study of invar iance or change, in 
practice change is emphasized* Changes in thought that are 
systematically associated with changes in the thinker have been of 
particular interest, for thinking cannot be described fully unless we 
know who«>e thinking is being described* Individual differences in 
cognition are far from randcmj they are systematically related to 
non-psychological attributes such as age, sex, and education in the 
broadest sense* Furthermore, intra- individual changes in thinking 
ran he produced by temporary changes in physical status, e*g* by 
fatigue, time of day, or drug state* 



What has been learned from these different approaches? The 
Cognitive Sciences have produced a number of case studies 
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(Simulations) and sone loosely stated general principles that are 
said to be dictated by these studies* Exper ifiental psychologists 
have produced detailed Models of how people respond to changes in 
specialized laboratory environments* but have not combined these 
models into a unified theory of Mental action* Studies of individual 
differences have produced "theories of intelligence" that are More 
scheMes for classifying Mental behaviors in terMs of their correlates 
than they are Models of the thought process itself* The theoretical 
contribution of physiological studies of cognition is even less 
clear* With the possible exception of Luria's (1966) atteMpts to 
develop a Model froM neuropsychological evidence* the literature 
provides us with data that a theory Must incorporate* but no clear 
picture of how the incorporation is to be achieved* 



PROGRAMS* LANGUAGES AND MACHINES AS THEORIES OF COGNITION 



Is there a basic assuMption about cognition that May unite 
diverse approaches to Mental phenoMena? The Cognitive Science 
approach does contain a coherent assuMption about the nature of a 
cognitive theory* The assuMption* which will be called the 
'coMputational Model' approach* is that coqnitive behavior is not 
explained unless one can Model the behavior by a sequence of 
coMPutations* The assuMption is not equivalent to coMputer 
simulation* although the two are related* pylyshyn (1980) has pointed 
out that computational Models exist at several levels* At the lowest 
IpvpI there is the prograM intended to smulate behavior in a 
particular situation* At progressively higher levels we have the 
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lanQuaqe in which the prograM is written and the Machine on which it 
is executed* In Most cognitive science studies considerable attention 
is paid to the language, ^s it is supposed to define a virtual 
Machine that eMbodies general principles about cognition* The 
physical Machire is treated as a convenient coMPutational aid, 
without theoretical Meaning* This is certainly true when we deal with 
conventional coMpute^s* Machine considerations cannot be so lightly 
dismissed if we wish to deal with brain-behavior relations* 

Figure 1 represents the relationship between prograMs, 
Janquaqest and Machines and, More iMportantly, between each of these 
levels and the different types of influences upon cognition* 
Obviously* a prograM is written in a language, and a language is 
executed by a Machine* We represent inforMation in the environMent by 
input to a proqraw* and we represent cognitive behavior by the 
prograM's output* Only the prograM is tied to observables at both the 
input and the output stage* Physical variables can be represnted by 
Machine specifications, bvt their influence will be filtered through 
the prograM, just as output of the prograM will be Modulated by 
Machine specifications* The prograMMing language is not tied 
directly either to input or output, a point which has substantial 
iMp 1 ications for the logical status of prograMMing languages as 
psycholoqical theor ies • 



FIGURE 1 HERE 
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If the cognitive sciences effort is successful we will 
eventually product a library of programs* These will be written in 
different language*, and there will be sone Machines (e*g* 
sufficiently large fcomputers) capable of executing any of the 
programs in any of tVie languages* This situation is depicted in 
Figure 2* There are two 'dreams' of the cognitive science movement 
that «ust be dispelled at this point* One is that every running 
program written in some psychological language will be an acceptable 
Model of human behavior* This seems extremely unlikely* The prograns 
that are starred in Figure 2 represent prograns that are stated in a 
simulation language but are not* themselves* realistic simulations* 
The other dream is that there will be just one programming language 
that can generate realistic simulations* If this were true* the 
constructs of this language could be assumed to have psychological 
reality* The Production notation (Newell* 1973? hcDermott and Forgy* 
1978)* which will be discussed in some detail later* has been 
proposed as a candidate language* 

The assertion that a particular language is the appropriate 
notation for psychology cannot be tested by pointing to examples of 
its use* There are languages whose superficial structure* at least* 
is decidedly different from a production language* but that can be 
used to write any simulation program that has ever been written* The 
inherently recursive General Problem Solver program (Newell and 
Simon* 1963) was rewritten in FORTRAN (Quinlan and Hunt* 1968)* and I 
strongly suspect that any other simulation program could be rewritten 
in a similarly primitive language* if one wished to take the trouble* 
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Figure 2 seems to More accurately depict the goal of the present 
Cognitive Science approach* Good and bad simulations will be written 
in a variety of computing languages* 



FIGURE 2 HERE 



How did the chaos of Figure 2 arise? The problem is that 
computing languages sufficient to write simulations .< and * in 
particular * the production notation) are powerful enough to be 
equivalent to universal Turing machines* Other* non-psychological* 
languages have the same power* so there is no way to contrast 
languages as psychological theories on the basis of the behavior of 
programs written in those languages* Faced with this situation* the 
usual appeal is to the "naturalness" of a program written using the 
constructs of one language or another* Production matching is said to 
be a natural psychological process* whereas Fortran subroutine calls 
arf» not. Th s is clearly not an objective evaluation* 
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Pylyshyn (1980) proposed an interesting solution to this 
problem. He would begin with an excessively powerful language and 
progressively restrict it until it generated only realistic 
simulation programs* Furthermore* the restrictions should be 
justifiable in terms of our knowledge of psychological limits on 
human thought* Such restrictions are said to be 'principled' ♦ The 
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idea May be illustrated by considering the production system notation 
itself* A production is s rule of the for* 



where P is a pattern that May be recognized, and A is an action to be 
taken* The usual interpretation is that productions are resident in 
long tern memory* and that a production is activated when its pattern 
is exemplified in working Memory (Newell and Simon* 1972)* A 
principled restriction would be to require that P contain at Most 
nine s 4 < M bols* in conformity with the now classic observation that 
this is an upper limit on short terr. Memory (Miller* 1956)* 

Pylyshyn's suggestions are difficult to follow so long as one is 
confined to a linguistic approach* The sorts of principled 
restrictions that seen to be needed refer More to the characteristics 
of the Machine on which a prograM is being executed than to the 
characteristics of the language in which the prograM is written* The 
restriction on the size of productions is a good exaMple* It sounds 
lik* the restrictions placed on* say* the level of nesting of 
parentheses* that Might arise froM a particular implementation of a 
qeneral language* A.-setiond problem arises when we study inter and 
3 ntra-inrii vidi >al differences • Under what circumstances do we change 
the program* the language* or the principled restrictions? Sometimes 
this issue is easy to handle* when we observe different strategies 
that people use* The theoretical issue is more difficult to resolve 
when pfw^ical alterations in the brain produce changes in behavior 
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that would b* determined at the program level in a simulation* Do we 
want to say that an executive program in the brain of a deep dyslexic 
patient (Morton and Patterson, 1979) suddenly switches programs for 
speech comprehension following a cerebral insult? 

Figure 3 summarizes this argument* by presenting a picture of a 
more real istic "target state 11 for the cognitive science effort • The 
figure shows a restricted machine that is designed to execute some 
programs in a powerful language ♦ Any program in the language * that 
can be executed by the machine* would be a simulation* Other 
programs written in the same language* for other machines* would not 
be simulations* although they might be of interest as examples of 
Artificial Intelligence. The language alone would be too powerful to 
be a psychological theory* If the machine's design is adequately 
motivated by principled restrictions* the implementation of the 
language on the machine would bp a psychological theory* a 
computational theory of thought* 



FIGURE 3 HERE 



In the following section a computational theory will be 
presented, and then used to describe some of the major phenomena 
studied by experimental* physiological* and psychometric 
psychologists. 
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A PRODUCTION EXECUTING MACHINE 

The conputational theory to be presented consists of a language, 
the well known production notation * and a Machine for inplenenting 
productions, subject to principled restr ictions* Various bits and 
» pieces of the Machine have been proposed in previous papers (Hunt* 
1966J X97X\ 1978} 1981* Hunt end Poltrock, 1974K Sirtilar proposal 
have been wade, in different terns* by Anderson (1976* Anderson arid 
Bower* 1973J Anderson* Kline and Beasley* 1978) and the relation to 
the work of Newell and Siwon (1972) will t ? clear* For brevity* I 
shall refer to a computational theory Machine* CTM* Although this 
phrase is sonewhat cluwsy in English* it does capture the idea that 
the machine is only part of a !*ore general theory of thought ♦ 

The CTH is based on the idea that thinking can be Modeled by 
proqrans written in the production notation* and thit this nota\ ion 
has psychological significance (Newell 1973* Hunt and Poltrock* 
19/4 >♦ Productions are thought of as rules resident in long ter« 
Memory (LTM>* Productions are executed by comparing their pattern 
parts to the contents of working nenory and* if a natch is found* 
takinq the appropriate action* Since one of the actions that is 
always taken is to change the state of working nenory* a "thought's 
in a roMPutation* 1 theory* is a succession of states of working 
«e«or y ♦ 

The production notation can be used to express complex thought 
sequences by tying productions together into "production systems", 
sets of productions required to solve a particular type of problem, 
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such as a chebs puzzle* As a sinple exaMple* consider the following 
systen for Manipulat ing algebraic stateMents* The system contains 
the mles 

Rl* -(-X) +X 

R2. XY + XZ -> X(Y + Z)* 



In these rules X. Y. and Z refer to any well forned algebraic 
expression* Now suppose that Working MeMory contained the expression 

El* AB-(-AC)* 

by application of rule Rl -(-AC) would become +AC* The pattern for R2 
wo"id be satisfied* producing the expression A(B+C)* 



FIGURE 4 HERE 



Figure 4 depicts the basic steps that Must be ta*en by any 
production executing Machine* The first step is pattern recognition* 
Each production's pattern part Must be Matched against the current 
situation* This is not .tecessarily an either-or decision* there can 
be degrees of Matching, If More than one production has a pattern 
that appears to be Matched* the Machine Must decide which production 
is to be executed* Th* is called the 'conflict resolution step' 
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(McDermott and Forgy, 1978)* Conflict resolution Must be carried out 
without considering the 'context of the situation 7 • i*e* the 
implications of choosing one production over another given the 
current representetion of the environment • The reason for this is 
straightf orward* If tha machine considers the current context, we 
have to explain the psychology of a homonculoid executive program* 
Finally, the machine must be able to execute the commands for action 
contained in its productions. The action part of a production must 
be stated in a vocabulary that refers to actions the machine can 
take • 

Figure 5 depicts the architecture of a machine that consists of 
the following parts* 



FIGURE 5 HERE 



a* A sensory system connecting the machine to its environment 

b* A long term memory system that contains the information that 
the syster. has ahout its past* This information is stored in the 
production notation* Each "elementar y" piece of knowledge that the 
machine has is expressed as a production* The organization of 
product ions into systems will be considered in more detail in a 
moment* 
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c* A working memory system that contains *nf^ nation about the 
current situation* Just what working memory should contain is a 
Matter of considerable debate* The commonest view ir» that it contains 
sections that, loosely, correspond to th* sensory systems* I would 
like to take the somewehat vaguer position that working memory 
contains an interpretation of the external world that depends up on 
perceptual responses that have been initiated by sensory input, but 
are themselves represented as productions resident in LTM* This 
argument is developed more fully in Hunt (197B)* 

d* An operating system containing a & ; of mental operators 
defining the actions that the system can take* These operators can be 
thought of as defining the vocabulary in which the action (A) parts 
of productions can be written* The set of mental operators would 
have to include operators for altering new production in LTM* 
Anderson et al* (1978) provide some suggestions* Commands to 
external effector systems would be part of the mental operator set* 
Neither learning nor motor performance will be discussed in any 
length here. 

e. A decision swstem that carries out the conflict resolution 
step • 

Figure 5 illustrates the flow of information and control between 
these systems. Information from the sensory system and from Working 
Memory address LIM directly. "Broadcast 11 than "Address 11 would be more 
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appropriate because the input is not directed to any particular 
location* Selection of information takes place within LTM* as a 
production will simply fail to recognize any input that is random 
with respect to its pattern* 



Working Memory receives its input directly from the operation 
system* That is* every input into working Memory (and every change 
of the contents of working Memory) is assumed to result froM the 
activation of one of a limited nuMber of basic Mental operations* 
These operations can be looked upon as the coMputing circjits of the 
CTM* The action part of a production Must be a command to execute 
one or more basic operations using specified data in LTM and working 
memory as input* Newell (1980) has provided a list of some of the 
operations that any symbol manipulating device would require* e*g* 
transferring information between working and loni term memory* 
Simple transformations of stored records or simple operations 
irvolving the merging of dat3 may also be executed by the basic 
operations. For example* we ombine color # location* and shape 
information in order to see unitary visual percepts* Tf.JS in a 
computational theory the ' laws of perception' provide one clue about 
the nature of the basic mental operations* 



The operations * in turn* can receive two sorts of inputs from 
long Term memory* a control signal indicating that a particular 
operation is to be activated* and a data signal indicating the input 
on which it is to act* Thus the input from long term to working 
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Menory is not analagous to siMply reading a record, although this 
presuMably does occur* Basic transforations of stored records* or 
sinple operations on a few records* May also serve as connands to 
this systen* 



Conflict resolution takes place in the control systeM* This 
systeM receives at Most one input froM each production in LTH* a 
siqnal indicating the extent to which that production has been able 
to watch its pattern part to the signal currently being broadcast 
throughout LTM* The strength of the signal froM a production will be 
a joint function of the correlation between broadcast signal and the 
pattern part of a production and of the current 'activation 
threshold' of that production* A forMalisM is useful to express this 
point* Let r<X*P) be the correlation between the signal currently 
being broadcast* X* and the pattern part* P * of sone production* 
Each production is assigned a threshold* T<p)* that May be changed 
froM tiMe to tiMe* The response of a production to signal X is 

(3) 0 if r<X*P> * T<P) 

Output = 

q(r <X*P)-T(P> ) otherwise* 



where q(x) is a non-deer easing function of its argunent* All the 
derision MecharnsM "knows" is the value of the outputs of the various 
productions* Conflict resolution reduces to deterMining which 
production is eMitting the strongest output* and allowing that 
production access to the operations systeM* 

ERIC 7a 



Lake Wilderness Attention Conference 



Stated this way, the conflict resolution probleM is trivial* It 
becomes non-trivial when we consider something that is certainly true 
of all biological systems* All signals are Masked by noise* To 
represent this, noise vectors are added to signals in two places, 
into the signal broadcast into LTM from the Working Memory and 
Sensory systems, and into the signal from LTM into the decision 
system* Obviously this will make pattern recognition wore difficult* 
The size of the correspondence between LTM patterns and working 
memory will be underestimated, and random fluctuations in watching 
working nenory to irrelevant patterns will be exaggerated* in order 
to Make sure that the appropriate production is selected for action, 
the decision systeM Must allow for the effects of randon variations 
in the signals that it receives* There are several possible decision 
policies it can use* All follow the sane general procedure? select a 
signal if it is either MOMentarily the strongest signal by soMe 
Margin, h, or if it has Maintained itself in position as the 
strongest signal for sone tiMe period, t* The par aMeters M and t 
should be set to reflect the relative values of signal and noise in 
the output of LTM* As a general rule, the greater the noise, the f 
longer it will take to achieve reliable conflict resolution when the 
signals froM LTM are constant* If the signals are variable over 
tiMe, as they would be if the environnent were transient, the amount 
of noise in the systeM would deternine the Max i mum speed with which 
the systeM could react to environmental change* 
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production, must be given* Each production in LTM is assumed to be 
tied to every other production in an associative network (Anderson 
and Bower, 1973, and Many others ♦> Formally, consider any two 
productions, s and r, which will be referred to as the sending and 
receiving productions. There will be some real valued link, a(s,r> 
that reflects the directional association between then* Whenever the 
sending production produces some non-zero output, o<s>, the receiving 
production will have its threshold lowered by an amount that depends 
upon the product, a(s,r> x o(s>* 

Computationally, what the associative network does is to 
establish production systems, since the activation o^ a sending 
production can sensitize closely associated receiving product ions ♦ 
perhaps to the point that almost any signal will be sufficient to 
activate them* This is shown in Figure 6* Linkage of productions 
through association proceeds independently of linkage that may be 
established through working memory, as the "spread of activation 11 of 
a signal from a sending to a receiving production is not dependent 
upon the sending production's being selected during the conflict 
resolution stage* This is also shown in Figure 6. This provides two 
different ways in which one production may activate another, 
automatic linkage through the association network, and controlled 
linkage through working memory* The terms 'automatic 7 and 
'controlled' are justified in the sense that controlled activation is 
directed b-* the decision system* An analogy to human performance will 
be considered in the next section* 

to 




68 



Lake Wilderness Attention Conference 



FIGURE 6 HERE 



As a way of summarizing these ideas, the steps in activation of 
a production system will be traced* The system begins in the state 
shown in Figure 7, with production PI linked to P2 P2 to P3 and P4* 
P3 linked tightly to P5 and loosely to P6, and P4 tightly to P6 and 
loosely to P5* Further links are also shown* Production PI Matches 
stimulus pattern SI in working memory* and the associated action Al 
is taken* producing S2 in working memory* At the sane tine production 
P2's threshold is lowered to such an extent that action A2 is taken * 
regardless of the result of the first action, producing S3* 
Activation through the association network does not discriminate 
between P3 and P4* hence the determination of the appropriate 
production will depend upon the extent to which these two productions 
can match S3* As there is no bias toward the one production or the 
other * the decision must be made at the conflict resolution stage, 
and thus will be affected by the efficiency of the decision system* 
The choice that is made at this point will bias the next decision* 
between P5 and P6* but will not determine it absolutely* as both 
productions will have been activated* but in unequal amounts* Thus 
the last step will be guided both by the "automate';' mechanism acting 
within LTM and by the "controlled 11 mechanism involving working 
memory * 
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FIGURE 7 HERE 



APPLICATIONS 



A computational theory of thought should provide a plausible 
framework for tying together disparate phenomena. In this use of the 
theory a broad brush picture must be painted* One shows that the 
general form of the phenomena of interest can be produced by the 
theory* but one does not go to the detail of actually matching theory 
to data* Such an evaluation will be called a macroscopic evaluation* 
The theory can also serve as a set of principles to be followed in 
designing models for specific experimental situations* The models 
can be tested against observed data* This is a microscopic 
evaluation* Both steps are 1 mpor t ant • The advantages of mi cr oscop i c 
evaluation have been stressed over and over* particularly within 
experimental psychology* I believe that microscopic evaluation should 
be preceded by macroscopic evaluation* Before going to the rathe** 
considerable work o* gener a*,ing and testing models for specific 
situations* we want to be sure that the models themselves are being 
generated by a theory that is* itself* acceptable on a priori 
grounds* The remainder of this paper is an attempt to establish the 
macroscopic acceptability of the CTM just described* 
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The psychological phenomena to be discussed fall into three 
broad categories? observations froM experinental psychology, the 
study of individual differences, and biological psychology* Each of 
these categories will be represented by two problems that have 
excited considerable study within their own field* In discussing 
each problew the basic phenoMena will first be described, then an 
explanation will be offered in terns of the Model, and soMe consents 
will be wade about how a conputat ional theory addresses soMe of the 
issues raised by the data* 



PHENOMENA FROM EXPERIMENTAL PSYCHOLOGY 



AUTOMATED AND CONTROLLED RESPONDING. Tasks vary greatly in the 
anount of attention that they require. "Attention" here has an 
explicit weaning. Responding requires attention if the response's 
characteristics are influenced by the presence of distractors, or if 
Making the response interferes with a concurrent task. Schneider and 
Shiffrin (1977) have reported a particularly draMatic exawple of 
antoMated responding based on the scanning of visual display. Their 
paradiqM is illustrated in Figure 8. An observer was first shown one 
or More target characters, followed by a very rapidly presented 
sequence of displays, each containing either a target character and 
N~ l distractors, or N distractors without a target character. The 
ta.-k was to report whether or not the target was present. When the 
:>«r*H nidraeters were targets on soMe trials and distractors on 
others, then the speed and accuracy of detection varied as a function 
of both the m»Mber of targets and nuMber of distractors on that 
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trial* The relationship is Maintained over thousands of trials* 
However * when characters were divided into characters used only as 
targets and characters used only as distractors* th^ relationship 
between performance and number of targets end distractors almost (but 
not quite) disappeared after extensive practice* The detection 
response was said to be automated* 



FIGURE 8 HERE 



Automated responding can also be established by extr laboratory 
learning* Most of the experiments showing this involve the highly 
overlearned responding to visual figures that are required in 
reading* If you are a skilled native speaker of English you cannot - 
look at the visual figure CAT without reading the word "cat*" 
Indeed* it is difficult to inhibit such overlearned responses even* 
when it would be to your advantage to do so* This is the basis of the 
Stroop phenomenon < Stroop * 1935 ) * 



9 
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The distinction between automated responding and its opposite* 
controlled responding* has been heavily featured in recent literature 
on attention and performance* Controlled responding is said to be 
characteristic of poorly learned tasks* and of tasks that involve a 
great deal of response incompatibility* Automated responding is said 
to be charcter istic of highly overlearned skilled performance* even 
where that performance is itself quite complex* I have even heard an 
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argument that automated responding is at the heart of skilled 
athletic perf or nance • 

In the Model proposed here controlled responding is defined as 
responding that involves the decision mechanism and* thus* is guided 
by information in working Memory. Automated responding is defined by 
the absence of such involvement* A response is automated if the 
appropriate production is activated with a minimum of processing 
during the conflict resolution phase* The theory requires that we 
distinguish two levels of automatici ty • The tightest possible 
connection between a stimulus and response occurs when the stimulus 
is connected to the response by a single production* An intermediate 
level of automation can be reached if a secence of productions is so 
tightly linkeo into an association network that activating the first 
production in the chain will almost always followed by the activation 
of the subsequent productions. 

What sorts of stimulus-response connections can be automated? 
Shiffrin and Schneider (1977) appear to take the position that 
virtually any stimulus can become the signal for an automated 
response* providing that during training the mapping from stimulus to 
response is always consistent ... i . e the same response is required to 
a fixed stimulus. Others have been more cautious. Freisman and 
Gelade (1980) have presented evidence indicating that it is 
difficult, if not impossible* to automate detection responses unless 
the %timi»li are perceptually primitive ones that can be defined by a 
peripheral sensory process. For instance* it is possible to develop 
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an automated detection response to 
distractors* or to an F in a field 
automate the detection of a red F 



a red stimulus in a field of blue 

V 

of X distr actors * but one cannot 

in a field of blue F's and red X's* • 



Results such as those just cited suggest strongly that automated 
responding is limited to figures that satisfy the perceptual 
requirements for 'good Gestalt*' Presumably such stimuli are 
identified at a relatively peripheral level in the sensory system* 
If such a limitation does apply to automated responding* some of the 
more ambitious uses of the concept* such as the anecdotal explanation 
for performance in tennis and basketball* would not be warranted* 
Some further data that my colleagues and I have obtained in our own 
laboratory complicates the picture* In one experiment (Yantis* Hunt* 
and bright* Note 1) we found that automated detection responses thpl 

4 

had been established tr upper case letters transferred almost 
perfectly to the detection of graphically dissimilar lower case 
letters* such as r and g* Thus we demonstrated generalization of 
automation along a semantic dimension rather than a physical one* 
Another experiment (Bhimamura and Hunt* Note 2) demonstrated the 
importance of semantic response dimensions ♦ In this study native 
readers of Japanese participated in a Stroop type experiment using 
both Kana (phonetic) and Kanji (ideographic) text* Greater 
interference was found using the Kanji script* which was related to 
t»,e interfering ( color ) response semantical ly * -even though the naming 
of coJors* thfc» overtly interfering response* proceeded more rapidly 
in Kana* Taken together* these experiments demonstrated automated 
responding along semantic dimensions of both stimulus and response* 
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Such bindings a^e not consistent with the position that automated 
responding is restricted to perceptually primitive units* 



The paradox nay disappear if we accept the idea that there are 
two levels of automated ret.ondingj one depending on the 
establishment, of a single production tying stimulus to response* and 
one depending on the establishment of associative lj.nks between 
productions ♦ The first case woul'1 l_?ad to a rapid* brief response 
characterized by a low threshold* and appropriately tuned patterns at 
the pattern recognition stage* and the emission of a strong single 
that would rapidly capture (and release) the decision system* This 
type of automation* however* would be limited to stimuli and 
respor^es that could be described within the limits of a single 
production* Within a computational theory the uestior. "«jh:c sort of 
stimuli can automated completely 11 is interpreted as a question 
r»bout the* principled restrictions that should be set upon the 
complexity of a production's pattern part* On the other hand* there 
should be no limit to the complexity of an "automated" response based 
upon c* tiq ly linked chain of productions* It should be noted* 
thouqh, that even a tightly linked chain would never reach quite the 
deqree of automation as would a single production* Soth working 
Memory and the decision swstc?i* would be required* albeit briefly* to 
quide the execution of productions in the "am/DMsted" chain* and thus 
there wo»«ld be some interference with concurrent mental activrty* 

Mr'NTfU R'SUUKCt-S* Why is it difficult to do two things at once' 
In a trivial case* two activities may be incompatible because they 

O 
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cannot be executed physically at the sane tiMe« Try juggling while 
playing the piano. Mental tasks May be siMilarly inroMpatiDle because 
two concurrent activities cannot use toe sane structu Competition 
for space in working Memory is typically offered as the explanation 
for iur inability to do such things as rehearsing poetry and the 
muI tip 1 i cat ion tabie at the sane tiMe ♦ In sone situations, though, 
it is difficult to explain interference as being due to coMpetition 
for soMe Mental or physical structure. It is difficulty although not 
quite iMpossible, to talk while juggling. To offer a wore 
p sschoiogical exaMple, perfornance on the Raven Progressive Matrix 
test, a widely used test of general intelligence, can be depressed by 
having a person balance a light lever with one hand while taking the 
test (Hunt, 1980). Hickens (1979) has listed a nUMber of ether 
exa*p les . 



KfihneMan (1973) has Maintained that we cannot account for 
non-structural interference without assuMing the existence of a 
"Mental resource", soMewhat akin to an econowic resource, that is 
shared bs concurrent Menta] activities. Two tasks will interfere 
with each other if their total resource deMands exceed the available 
smppIm of the resource. This idea has been elaborated upon in several 
subsequent papers (NorMar and Bobrow, 1975} Navon and Gopher, 1979? 
Po^ner, 1978 J Hunt and LansMan, in press.) While the need for the 
concept of a Mental resource is widely acknowledged, the n*\ure of 
that resource has never been specified* Thus it is difficult to 
connect Models of » erforMance that use the concept of Mental 
resources to Models of perforMance that deal with inforMation 
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processing by different structures* This is the sort of Macroscopic 
issue that should be addressed by a general theory* 



In conputat ional theory "concurrent activity" occurs when the 
executions of productions fron two production systems* A and B f are 
interleaved* In this case the tasks associated with the two systems 
will be said to be "tine shared*" If we write Pa<i) for the ith 
production of »ysteM A and Pb'j) for the Jth production of systen B* 
the sequence of production execution for tine shared tasks night be 
night beJ 



rroM an observer's point of view actions relevant to each task 
would be executed simultaneously so long as production execution 
within each task was rapid enough not to produce recordable delays* 
Interference co<ild arise in several ways* The presence of stiMuli 
froM task H in working Memory Might influence production selection in 
task A* bs altering the pattern recognition phase* Mental operators 
required for on task Might be preeMpted by another* or the proces of 
ronflict resolution Migh* be slowed by the simultaneous activity in 
production s^steMs A and B* 

Two production systeMS will be defined as being structurally 
independent if (a) the pattern parts of ill productions within one 
systeM are randOM with respect to the pattern parts contained in the 



Pal* Pa2. Pbl* Pa3* Pb2*Pb3 



* * * 
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other systeM and (b) if the action par + s of the two systens do not 
use the sane (Mental) operators. If two production systeMS a^e 
str uctur al ly 1 ndependent interference due to cor f 1 icts in the pattern 
recognition and production execution phase cannot occur ♦ The two 
tasks way still interfere if the conflict resolution phase becones a 
bottleneck ♦ This will be called resource coMpetition, and "Mental 
resources" will fce defined as having access to the decision systeM. 
Resource coMpetition can occur if the decision systeM receives two 
approKiMately equally strong signals , fron two independent systeMS, 
both requesting access to the working MeMory-Mental operation systeM, 
even thonjh the two productions "want" quite access to different 
Mental structures. This is shown scheMat xcal ly in Figure 9. 



FIGURE V HERE 



Resource coMpetition can be understood by a siMple analogy. 
iMagine an airline reservation clerk who is faced with two custoMers, 
e^eh of whoM wants to go to a different place, at a different tiMe# 
There ls no conpetition between passengers for seats on an aircraft, 
i.e. no structural roMpetition. The custoMers do coMpete for the 
attention of the airline clerk. In particular, suppose that queue 
tliscipl ine is non-existent, so that the custoMer who shouts the 
loudest receives iMMediate attention. The More custoMers trying to 
be served at any one tiMe, and the More vocal they are, the harder it 
will be for ans one custoMet to get service. FurtherMore, and this 
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is an extreMely important point, the degree of interruption between 
custoners (and between tasks * in the Model) will be deter wined by the 
disparity between the custoMers' voices* If one customer is clearly 
shouting wore loudly than anoth r* then the clerk will immediately 
service the noisy custoner* In terMs of the theory* the Most 
activated production will be allowed to execute* Resource 
coMpetition occurs when all the custowers shout at an equal level of 
loudness* thus forcing the clerk to ask the* *o shout again* so that 
the loudest voice can be located* 

This explanation of interference between concurrently executed 
tasks is clearly related to the explanation offered for the 
distinction between controlled and autoMated tasks* It follows that 
the More hiqhly autoMated tasks A and R are* the less usp they will 
wake of the decision systeM* and thus the easier it will be to do 
the* concurrently* On the other hand* the theory states that even 
the MO?>t autoMated tasks Make sone use of the decision sy«teM* Thus 
if one kept adding More and More automated tasks into a concurrent 
activity* there would be a point at which interference would be 
produced* It Might be lwpossible to do ta^ks A*L* and C without 
cost* even though any pair of theM could be done concurrently without 
their interfering with each other* 

Two individual differences issues that are the topic of current 
study by e;:per mental psychologists can be addressed using the 
concept of attention as a resource* Are soMe people better than 
others at doing concurrent activities? In coMputer science jargon* is 
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there 3 dimension of individual differences that could be called 
"time sharing ability"'* This apparently simple question turns out to 
have a complex answer ♦ According to the theory a person who was an 
efficient tine sharer would be a person who executed conflict 
resolutions quickly, so that the execution of productions from 
different systems could be inter leaved ♦ It is not at al 1 clear ♦ 
though, .hat this would show up as a "time sharing ability 11 in a 
conventional correlational study of performance* Two activities 
would Most easily be time shared if they were h. My automated* In 
such a case tine sharing capability would be relateu to the ability 
to do the tasks singly, since degree of automation would predict 
Performance both in the single activity and concurrent activity 
situations ♦ Suppose that the activities were both controlled tasks* 
In «*iirh a case the tasks, when executed singly, would reflect the 
efficiency of the decision system, because it is so involved with the 
execution of controlled tasks* These observations pose a problem for 
the identification of individual differences in time sharing, because 
one would always expect performance in single task conditions to be 
related to dual task performance* This is true even though the 
postdated mechanism for the control of concurrent activities is a 
relatively simple one* 



F'syrholoqi sts and educators have lonq been interested in the 
difference between expert and novice performance* Larkin et al* 
(1980) propose that the distinction be modeled by differences in the 
production systems required to model each type of behavior* It 
appears that experts can be better modeled by systems that are more 
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tigntly linked, so that the detection of 3 single key characteristic 
of a probleM l^ads to the iMMediate execution of a stage of relevant 
actions, as a unit* To illustrate, once an expert chess player has 
derided upon an nooning the appropriate sequence of Moves follows 
quickly, up to the next choice point* A novice Must discover each 

MOV* • 

Tf this view of expert behavior is cuMbined with with the 
theoretical explanation of resource coMPetition just offered a 
prediction follows that, insofar as I know, has not been tested* 
Figure 10 depicts the production systeMs of an expert and a novice, 
doing soMe abstract task* Double lines have been used to indicate 
tight linkage* When the expert detects a signal sufficient to 
activate production PI productions P2 and P3 will follow 
automatically • During this period the expert's decision system would 
he free to respond to extraneous signal, (Px in the diagraM) because 
it would be easier to interleave response to the signal with the 
execution of autoMated task relevant activity* The situation changes 
after the execi iti on of production P3* At this point a choice point 
occurs, and the expert's decision systeM will be fully occupied with 
the discrimination between two strongly activated signals, froM 
and PJm I he extraneous signal will not be able to break into the 
conflLrt resolution sequence unless it is stronger than either of 
these • In the novice's case, the discrimination will be between two 
weak signals, and hence it will he easier for an irrelevant signal to 
capture the decision system* 
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FIGURE 10 HERE 



Suppose that two people, an expert and a novice, are asked to do 
2 task and, simultaneously, to respond to a probe task, such as 
turning of a light if it cones on* If the probe is presented at a 
time at which performance on the Main task is routine the expert 
should respond to the probe task More rapidly than the novice, other 
things being equal ♦ If the probe is presented at a tine at which 
alternative behaviors are possible in the main task, the expert 
should be slower to respond to the probe than the novice* 

INDIVIDUAL DIFFERENCES ISSUES 



THE SOURCE OF INDIVIDUAL DIFFERENCES* To what extent are 
individual differences in cognitive performance due to physical 
differences between people, and to what extent are they due to 
differences in the knowledge that people possess? This question has 
perhaps engendered wore controversy than any other issue in 
psychology. There is a natural way to frame the question within a 
computational theory* What individual diffr ences in per for fiance 
should be Mimicked by changing the production systems available to a 
simulation, and what differences should be mimicked by chanq?ng the 
parameters of the underlying machine? 



It is certainly true that all performance depends upon the 
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possession of the appropriate productions by the performing system* 
Simply having the productions in LTH* however * is not enough* They 
must be tied together into appropriate production systems* Loosely, 
the presence of a production only ensures that the system knows how 
to make a particular reaction to <* stimulus* The linkages between 
productions determine the contexts in which this reaction can occur* 
It has been amply demonstrated that these two different forms of 
knowledge represent different dimensions of individual performance in 
humans* One of the striking findings of recent research on mental 
retardation* for instance, is that retardates often are capable of 
executing basic mental tasks* such as rehearsing a list of numbers in 
OPcfor to memorize it* but that they do not recognize the context in 
which such strategies are appropriate (Robinson & Robinson* 1976* 
pg*299)« \ 

Individual differences in the possession and organization of 
productions can be thought of as differences in knowledge possession* 
Individual difference* in the effectiveness of production execution 
at the machine level* differences in what has been called the 
"Mechanics of thought" (Hunt and Poltrock* i974> are also important* 
The mechanics vs* knowledge distinction offers a way to think about 
the classic nature-nurture distinction* Nurture could affect 
mechanics as wel 1 as knowledge* but nature (genetic int luences ) C£n 
act only throuqh alteration of the biological machi >ery that 
influences our thinkinq* 

If the computational viewpoint were to be accepted by behavior 
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geneticists it would lead to the design of studies of the 
"inheritance of intelligence 11 that are rather different froM Most of 
those conducted to date* Most behavior genetics studies use (often 
elegant) MatheMatical analyses of genetic Models to partition the 
variance in individual perforMance on quite coup lex Mental tests, 
such as the Wechsler scales or the Raven Matrix test discussed 
earlier* According to a coMputational theory of thought, it would be 
More sensible to use as one's dependent variable extreMely siMple 
tests that, insofar as possible, tapped e single component of the 
cognitive coMputat lonal process* The obvious candidates are tests of 
the pattern recognition and conflict resolution processes* t**sts of 
the efficiency of working MeMory, and tests of specific Mental 
operations* The h no ledge gathered fron such studies would be harder 
to fit into *uch global questions as "what is the heritability 
coefficient of mtel 1 igence?" , but the coMPutational viewpoint does 
not regard this as a terribly relevant question anyway* It Makes More 
sense to exaMine individual differences in specific coMponents of 
Mental action* 

tXHAlNING THE DIMENSIONS OF INTELLIGENCE* PsychoMetnc 
theorists break intelligence into a nuMber of conponents* No attend, 
will be Made to review the various proposed dimensions of 
intelligence* The interested reader is referred to Nunnally (1978) 
and Sternberg (1977) for discussions froM different points of view* 
For our purposes, a particularly interesting theoretical distinction 
is Lattell's (1971) distinction between crystallized (GO ano fluid 
<Gf ) lntelliqence* Horn has elaborated upon this distinction in 
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gr^at detail, and the renarks in this section should be understood as 
referring to Gc-Gf theory as presented by hi* (Horn and Donaldson, 
1979) ♦ 

Crystallized intelligence (Gc) is assumed to be a dimension of 
individual perf ornance associated with the ability to use highly 
overlearned, culturally acquired infornation in an appropriate 
fashion* A vocabulary test is the epitope of a task requiring Gc* 
Indeed , Most tasks that are Markers of Gc are also verbal tasks, 
which does present a probleM of interpretation* Fluid intelligence is 
a di Mens ion of per forMance associated with the ability to solve new 
problens rapidly and efficiently, perhaps by inventing a solution 
strategy on the spot* Thus fluid intelligence places More eMphasis 
on the ability to rearrange inforMation, while crystallized 
intelligence places More eMphasis on the ability to retrieve it* 
Mmd intelligence is often tested by non-verbal intelligence tasks, 
such as the Raven Matrix test Mentioned earlier* In fact, this test 
is soMetiMes considered a Marker for Gf ♦ Verbal tests of Gf can also 
be constructed ♦ Such test eMphasize verbal reasoning about presented 
inforMation rather than the retrieval of verbally coded infr nation « 

FroM a coMPutat lonal viewpoint tne Gc-Gf distinction is closely 
related to the distinction between controlled and autonatec* tasks* 
pHrforMar.ee on tasks requiring Gc should deMand (a) that a person 
have the necessary production systeMS present and (b) that the 
situation for their activation be recognized* Once the 
appropr lateness of the product ion systeM is recogni zed , its 
execution, being based on highly overlearned information, should be 
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relatively automatic and efficient* Tasks demanding Gf are tasks 
that, by definition* cannot be solved by tightly sequenced systems of 
productions* Hence a solution Must be developed one step at a time* 
Such tasks should be more influenced by the efficiency of the 
decision system and of working Memory as these systems * taken 
together* determine the selection of productions in situations in 
which tightly linked sequences of productions do not exist* 

Liver the past several years a number of studies have been 
reported that bear directly on this issue* Tasks that require the 
simple retrieval of information from long term Memory (i*e* tasks 
that test the pattern recognition system) typically show reliable but 
moderate correlations (about *3) with psychometric measures of 
crystallized intelligence* This was first shown for the stimulus 
matching task (Po^ner and Mitchell * 1967)* in which a person is shown 
two different forms of the same lexical item***e*g* the pair of 
?>Mmbols CAT and cat***and asked if they have the same name* People 
who do this rapidly tend to have high scores on Gc tests 
( Hunt * 1970b ) ♦ More recently we have obtained similar results using 
other test*- of the retrieval of over learned information* such as the 
semantic ver if xcat ion (Col 1 ins and Qui 1 1 ian* 1970 ) and lexical 
i dent if irat ion (Meser *Schvaneveldt * and Ruddy * 1974 ) tasks (Hunt * 
Davidson* and Lansman* in prass)* 

Kdther different results are obtained if one uses as the target 
variable a test of fluid intelligence* such as the Raven Matrix test* 
While there is much less information available* it appears that the 
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information processing paradigms that are most highly correlated with 
Raven Matrix performance are tests of choice behavior, as exnp lified 
by the slope variable in a Fitts' type choice reaction study, and 
tests of rapid verbal reasoning, such as the sentence verification 
task developed by Clark and Chase (1972)* The latter task is believed 
to have a heavy verbal component* Some typical measures obtained in 
the study of information processing correlates of Gc and Gf tasks are 
shown in Table 1. They suggest, but certainly do not prove, that Gf 
is more closely related to decision making than to pattern 
recognition* 



TABLE 1 HERE 



ISSUES IN BIOLOGICAL PSYCHOLOGY 

AG TNG* Botwimck (1977) has summarized the gross observations 
about aqp and intellectual change by referring to a "classic aging 
pattern" • As people move through the adult years verbal performance 
is maintained and even increased, while performance on non-verbal 
inleJ i mence tests decreases* The picture ?s not quite thai simple, 
because verbal tests that involve reasoning and inference appear to 
show ci decrease , whi le tests of simple retrieval and recognition of 
ver^ 1 meanings increases (Cohen, 1979)* Horn, in his extensive 
writings on this subject (see, in particular, Horn, 1978, 1979) has 
argued that the correct distinction is between Gc and Gf functions 
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rather than between verbal and non-verbal functions* Hasher and 
Zacks (1979)* in an article that is quite compatible with the theory 
proposed here* clam that as one ages automated functions are 
retained but controlled processing becomes More difficult* 

Why should this pattern of change occur? From the viewpoint of a 
computational theory of thinking* what functional changes would ^ad 
to these particular changes in performance, and how Might these 
functional changes cone about? A number of authors have suggested 
that what nan happen its that the aging brain simply is not a reliable 
transmitter of signalfc between its various parts* This could occur 
because the death of nerve cells would, over one's lifetime* produce 
a progressively less reliable signal, even though the locus of nerve 
cell death was uncor r el ated with the locus of information stors^« in 
the brain. Anotner possibility is that demyel inizat ion of the nerve 
cells may produce unreliable signal transmission. in terms of a 
computational model, this wou..d be reflected by an increased error 
component in the signals transmitted through the sensory, working 
memory* and decision systems. This would lead either to more 
erroneous processing if the speed time of responding were to be 
maintained* or to slower mental processing if the same level of 
accuracy were to be maintained. It appears more consistent with the 
literature to assume that the latter is the case, mental processing 
in the elderly appears to be a (not entirely successful) effort tc 
sacrifice speed of processing in order to retain accuracy (Welford. 
1977). Deterioration of performance should be less evident in 
automated (and Gc> tasks, since unreliability would affect only the 
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pattern recognition stage, whereas controlled (and Gf) tasks would be 
affected at both the pattern recognition and decision stages* 



NEUROPSYCHOLOGICAL PHENOMENA* The study of behavioral changes 
associated with brain injury provides us with a fascinating glimpse 
of the specificity of Mental functioning* Surprisingly* there has 
bet , remarkably little integration of neuropsychological observation 
into the theoizing of either experimental or psychometric psychology* 
Possibly this is true because the "average" head injury case has 
little to tell us beyond reaffirming some general principles* such as 
the principle that most language functions are located in the left 
hemisphere* He learn much more from the occasional cases in which 
small* discretely localized insults to he brain produce highly 
specific behavioral defects* Examples of such cases are the "deep 
dyslexic" cases reported by Morton and Patterson (1979)* in which 
Patients have selectively lost the ability to deal with English 
function words* or cases in which selective loss of either long term 
or short term memory function appears (Shallice* 1979)* 



The careful study of such cases is of interest in the 
construction of a computational theory of the mind because 
neuropsychology gives us some indication of what the basic mentax 
operators are* He know* for instance* that there is a specific 
operator for storing information in LTM because we know that it is 
Possible to disable this operation* on an amazingly selective basis* 
by damaqe to the hippocampus (Milner* 1970)* Systematic analysis of 
selected cases in neuropsychology may enable us to compile an 
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extensive catalog of operators that are "basic" in the sense that 
they can be selectively physically disabled* Such a catalog would be 
useful because it would provide us with a vocabulary for stating the 
action part of productions* This would be an important, and 
psychologically principled, restriction on the power of production 
languages* 

The view that neuropsychological defects are equivalent to a 
loss of specific Mental operators provides us with an inportant and 
in sone ways Igniting theoretical interpretation of the aftereffects 
of brain injury* Loss of previously denonstratable functions would 
be associated with an inability to execute productions* but not to a 
loss of the productions thenselves* Thus loss of function should be 
largely* if not entirely, a loss of the "controlled" portion of 
previous learning* s i *e* the ability to use a production to produce 
changes in working nenory* Effects associated with the spread of 
activation fron an aroused (but now unexecutable) production to other 
productions would renain* This is the way in which a computational 
theory can account for such phenomena as the observation that sone 
neuropsychology patients hay have an enotional response to an object 
that they cannot recognize* and that even when an object is 
nissnaned > the erroneous response is connected to the correct 
response by sone siwple senantic rule, such as saying "uncle" for 
"nephew" (Marshall & Newconbe, 1966)* 

U 
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CONCLUDING REMARKS 



Ii. this paper the nature of a computational theory has been 
emphasized. Such a thenry provides a framework within which models 
of specific tasks can be generated* In general* there will be more 
than one possible model for a given task* Development of the theory 

ill involve the generation of models that can apply to concrete 
tasks* and the selection amongst them of models that can describe the 
data of specific experiments* 

If the theory is to be developed further we need models of four 
key functions; the pattern recognition process* the conflict 
resolution process* the manner of representation of information in 
working memory* and of the action of permissible mental operators on 
this representation* Empirical work to test these models will 
probably follow the traditions of experimental psychology* We need 
to find situations in which we can assume that the production system 
programs to do the task are trivially straightforward* so that we can 
be sure that behavior is being controlled at the "mechanics level" * 
by the design of the machine executing the program rather than by the 
design of the program being executed* The emphasis here is on the 
use of Cognitive Science concepts to explain behavior in the 
psychology laboratory • 

As this work is going on* other cognitive science specialists 
will be working in quite another tradition* Attempts are being made 
now* and will continue to be made* to write production system 
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programs that are capable of *.imicking the broad brush of human 
behavior in a variety of very complex tasks* The work of Riesbeck 
<1975) on the understanding of text is a good example* Hopefully 
these two lines of study will coalesce* Eventually we would like to 
see complicated programs for the management of knowledge being 
executed on (virtual) machines that had been established as 
reasonable models of the mechanics of human thought* Such a hope can 
be looked upor. as a statement of the ancient goal of reductiohism in 
science* Sociology should be reduced to psychology* psychology to 
physiology* physiology to chemistry* and chemistry to phyiics* More 
realistically* the goal hi»re is to reduce "the psychology of problem 
solving' 1 to "the psychology of information processing* 11 * and to point 
toward a connection between information processing and physiological 
Psychology* In the past efforts toward this goal have not been too 
successful* The reason that the Cognitive Science effort may succeed 
is that it has a new and powerful set of concepts* The distinction 
between program* programming language* and machine design may provide 
the ideas we need in order to establish a reductionist theory of 
mental action* 
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TABLE ONE 



PSYCHOMETRIC VARIABLE 

EXPERIMENTAL TASK Reading comprehension, Raven Matrices 

Vocabulary 

Lexical identification moderate 8na11 

Semantic categoriiation 



Speed of choice 



77 moderate 

Types of Experimental Paradigms that have been 
related to performance on psychometric tests 
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BEATTY: NEUROPHYSIOLOGY OF PROCESSING RFSOURCES 



1.0 Introduction. 

The human nervous system, like any physical Information processing 
system, 1s limited 1n capacity. Capacity limitations are unimportant when 
processlnq demands are low, as all demands can be easily met. However, 
people are frequently required to process more Information at hlqher rates 
than their nervous systems can accommodate', a situation that leads either to 
a restriction in the tasks that are attempted or to a necessary 
deterioration 1n performance quality. It is not surprising that the nature 
information processing limitations has formed a central focus for 
coqnitive psychology in the past quarter century. 

A number of views of the nature of processing limits have heen proposer 1 
by cognitive psychologists, flroadbent, 1n his seminal book Perception and 
Communication (l nr >fl) saw the critical limitation 1n processing as a slnqle, 
non-sharablo information channel linked to the senses by a selective filter 
and passing its output to memory and response systems. However, evidence of 
concurrent, high-le^el processing of semantic Information from multiple 
speakers led Morav (lff>7) to proposed that capacity nay he shared between 
different processes or tasks. Kahneman (1973) adapted Moray's general 
capacity rodel to an expandable general capacity theory, persuaded by 
physiological evidence that processlnq capacity may Increase with increasing 
demands for that capacity. Kahneman Identified capacity with attentlonal 
processes, a view that is shared by many contemporary theorists. 

These Moas were formalized and generalized by Norman and PiOhrow 
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BEATTY: NEUROPHYSIOLOGY OF PROCESSING RESOURCES 



(l c 7*), who introduced the concept of multiple processing resources in their 
brief treatment of time-sharing behavior: 

i 

"Any information processing device has programs and some mechanism 
for executing those programs. When a program is executed, it 
requires input data and it consumes resource* A set of programs 
that is being executed for a common purpose and for which 
resources are allocated as a unit is called a process. Resources 
are such things as processing effort, the various forms of memory 
capacity, an^ communications channels. Resources are always 
United (p. 45). M 

The lack of convincing behavioral evidence in support of a single processing 
resource has encouraged the development of multiple resource theories (Navon 
and r.opher, 107<\ 1W0; Sanders, 1<79; Wickens, 1979, IPSO). However, the 
persistent difficulty with all multiple resource theories is the 
identification and specification of putative processing resources. 

It would seem, after several decades of research on the problem of 
processing limits that, there would be some Concensus in the field regarding 
the most appropriate type of model, but this ,is not the case. Sanders 
(1979) in Ms recent, thoughtful reviev/ found compelling evidence in support 
of the sinnle channel hypothesis, Kahneman's effort model and multiple 
resource theory; none of these models appears to be completely 
satisfactory. 
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Three fundamental questions appear to confront any theory of processing 
resources: 

1. Is there a single general processing resource 1/ are processlno 
resources multiple? 

?• How can a processing resource be Identified and specified? 

3. Is the processing capacity of a resource fixed or partially 
expandable as Kahneman suggested? 

It appears that these Issues can be resolved only by empirical evidence, but 
the behavioral data presently available are Insufficient 1n themselves to 
provide an acceptable solution. 

A complementary approach to these questions 1s that of hunsn 
neurophysiology. Since the processlnq resources 1n human cognition must be 
functions of the human nervous system, human neurophysiology ma> provide an 
important source of theory and data for the study of processing resources. 
Hunan neurophysiolony, 1n my opinion, may make three types of contributions 
to the development of cognitive theory. First, 1t provides an Important 
source of converging data for the testing of theoretical positions. For 
exanplo, in later sections of this chapter, evidence concerning each of 
three questions mentioned above will be reviewed. In this way, 
neurophvsiologiral data can help resolve some of the more difficult 
questions in cognitive theory. Second, neurophyslologlcal data may provide 
important constraints on the tvpes of cognitive theories that are to be 
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considered. Proposing theories of human cognition that are incompatible 
with what is known about the human nervous system is not qood psychology. 
Finally, neurophysioloqical data mlqht serve a heuristic role in the 
development of cognitive theory, suggesting specific hypotheses that are 
amenable to behavioral testing. A few such suggestions appear later in the 
chapter. Thus, I believe, human neurophysiology may serve as an important 
— and ofter neglected ~ source of information in developinq a more 
satisfactory theory of the structure of human cognitive processing. 

Throe topics of recent neurophysiolcgical inquiry seem relevant to the 
study of processing resources in the human brain,, The first involves the 
use of netaholic mapping techniques to study cerebral energy metabolism 
during information processing in the human nervous system. This new 
literature suggests that procession resources mey be expandable in the sense 
proposed by Kahneman (1973). The second, related literature involves the 
measureront of regional cerebral blood flow during human information 
processinn. These data support the idea that there are multiple, specific 
prccossinq resources and provide heuristic information for their 
identification. The third line of neurophysioloqical research involves 
peripheral indications of reticular core activation durino the performance 
of connitive tasl'S. These data support Kahneman's original effort 
hypothesis and sugqosts a hierarchical model of the structure of processing 
resources. Tonet'ier, these data may serve to clarify the problematic 
questions of human information procession resources, their nature, their 
structure and their functions. 
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2.0 Cerebral Enerqy Metabolism and Human Information Processino 

Although the human brain accounts for only 2 percent of the bodv's 
weight, it utilizes 20 percent of the total resting oxygen consumption of 
the body. If the flow of blood to the brain 1s Interrupted, all available 
brain oxygen reserves are depleted within 10 seconds and unconsciousness 
results. Brain oxygen levels are critical because oxidative qlucosp 
metabolism is the only source of energy for the neurons of the central 
nervous system and processing Information within the CNS requires energy. 

Information transactions within the human brain depend upon the action? 
of larnr populations of Individual neurons, which communicate among 
thenselves by synaptic interactions. Further. Information is communicated 
between different brain regions by spike potentials generated in the 
exciUhle membranes of the cell body and axons. Thus, the Information 
processing functions of the central nervous system may be linked to Its 
energy consumption at several levels (Krnjevlc, 1975): 

1. The sodium-potassium pump. The standing membrane potential which 
provides the basis for all Information processing at the cellular 
level is the product of an active, energy consuming membrane 
procpss that ants to transport sodium out and potassium into thn 
neuron. In doinq so, 1t 1s moving paired 1ons aoalnst both 
concentration and electrical qradients, an action that requires 
significant amounts of metabolic energy. The generation of an 
action or spike potential in a neuron <s effected hy momentarily 
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permitting the exit of potassium and the influx of sodium, which 
creates future work for the sodium-potassium pump. Thus, the 
metabolic load placed upor this membrane pumping system is a direct 
I, :ear function of the firinq rate of each neuron in the central 
nervous system. This is one way in which function and metabolic 
load are coupled. 

?. Synaptic events. Many synaptic events ^ are active neurochemical 
processes requiring energy in their function. For example, 1n 
synaptic endfeet all available adenosine triphosphate (ATP), the 
final enerqy-rich product of cerebral qlucose metabolism, 1s 
completely utilized and replaced every 3 seconds. Metabolic enerqy 
appears to be required for neurotransmitter release, reuptake and 
deactivation. Similarly, active processes are involved 1n the 
response of the postsynaptic membrane to neurotransmitter release. 

3. Synthetic nrocesses. Energy 1s required 1n the maintenance of 
proper supplies nf neurochemicals utilized in cellular Information 
transactions. These include the manufacture of neurotransmitter 
substances, neuromodulators and other molecular neural supplies. 

These and other, similar relations link cellular information processing 
functions with cellular metabolic demands. It is for this reason that 
measures of cerebral oxidative glucose metabolism provide a primary 
indication of the intensity of information procession within selected 
reqions of the nervous system. The nrocess of olucose metabolism begins 



with glucose uptake by neurons as a function of their metabolic demands. 
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This process was once thought to depend primarily upon passive diffusion 
across the membrane, but 1t now appears to involve fast acting, active 
transport channels (Bachelarri, 1975) . These channels are sensitive to the 
molecular structure of glucose, transporting some sugars and not others. 
But most Important 1s the fact that the glucose transport mechanism is 
regulated with respect to rate and may, in fact, provide a urinary mechanism 
for controlling the cellular metabolic rate for glucose. 

The principal metabolic pathway for glucose energy releaf^TnvoTves the 
phosphorylation of glucose to glucose 6-phosphate by the enzyme 
glucohexokinase. The final result of oxidative glucose 1s the production of 
energy-rich ATP. As each molecule of glucose is oxidized, 36 molecules of 
ATP are produced. It is the ATP that directly provides the source of energy 
for the conduct cf information processing functions in neural tissue. 

?.l P-Deoxyglucose Measurement of Cerebral Glucose Metabolism 

Early attempts to measure brain metabolic activity relied upon 
comparisons of blood gases and energy substrates in the arterial supply and 
the vonous return from the brain. The results were unlnformati ve: 
functional aspects of central nervous system activity did not appear to 
affect cerebral metabolic rates. This lack nf metabolic responsivlty gave 
rise to the Idea that brain metabolism is unrelated to brain function, 1n 
much the same way a computer's energy requirements are unrelated to the 
details of the information transactions taking place. 



ERIC 




115 



BEATTY: NEUROPHYSIOLOGY OF PROCESSING RESOURCES 



The problem, of course, stems from the fact that the brain 1s not a 
functionally uniform structure, but is Intricately differentiated. 
Different Informational transactions within the nervous system Involve 
different portions of the brain. Thus, 1f some brain structures were 
Increasing and others decreasing their metabolic demands, little effect upon 
the aggregate arterial-venous differences would be expected. 

It was therefore a finding of considerable Importance when a method for 
accurately measurlnn cerebral metabolic rate for glucose (CMRGlc) with 
microscopic spatial resolution was reported by Sokolov and his collaborators 
(See Sokolov, 1979, for a detained description of this methodology). By 
uslnn a radloactlvely labelled analog of glucose that 1s only Initially 
competitive 1n the glucose metabolic pathway, Sokolov was able to obtain 
photographic Images of brain tissue 1n which Image density 1s proportional 
to the local rate of qlucose metabolism. 

2-d^oxygluc6se (?-DG) 1s an analog of glucose t'iat 1s completely 
competitive with glucose for access to both the membrane qlucose transport 
channels and phosphorylation by qlucohexoklnase. However, the product of 
the phosphorylation, 2-deoxyqlucose-fi-phosphate, does not compete with 
glucose-fi- phosphate for further processing. Thus, when 1s Injected 

into the systemic circulation, a metahollcally active neuron will take up 
?-[)G 1n proportion to Its metabolic rate fnr glucose. But because the 
metabolism of ?-DG is hlockad at an early stage, 2-PG will accumulate in 
neurons and not be returned to the systemic circulation as metabolic 
byproducts. Thus the amount of ?-DG present 1n a neuron is a joint function 
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of thfe amount of ?-DG 1n the general circulation, the length of time 
following Injection, and the average CMRGlc for that cell during the period 
of 2-OG availability. 

The 2-PG method for CMRGlc measurement was applied to the study of the 
human brain by Phelps and his colleagues (Phelps et al., 1979, 1980, 1981). 
Using a fluorine-labelled 2-deoxyglucose (FOG) human volunteers are 
intravenously Injected and placed 1n experimental or control conditions for 
a period of ?0-40 minutes to permit accumulation of the labelled compound in 
metaholically active brain regions. A noninvasive positron emission 
scannijnn device, which is similar 1n many was to the now familiar CAT 
scanner, is used to measure an^ locate the sources of radioactive particles. 
Fron these date, computer procedures are employed to derive density 
tomograms, or computed planes of section throuqh the brain, that reflect th<* 
local CI'RClc. At present, the resolution of the reconstructed Images is 
approximately 1.6 cubic centimeters. It should be noted that the actual 
dose of radiation involved 1s rather small. Furthermore, the effects of 
radioactivity are short-lived, the half-life of fluorine being approximately 
110 minutes. 

2.2 Cerebral Glucose Metabolism 1n the Resting Human Brain 

Determination of the basal rates of cerebral glucose metabolism is a 
necessary first step in the study of modifications of CMRGlc produced in 
specific brain regions during Information processing. Such data have 
recently been published by Mazziotta et al. (1980) for the human brain at 
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rest. 

Mazzlotta and his collaborators employed the FOG tracer and positron 
emission scanning techniques to measure regional CMRGlc 1n 7 neurological ly 
normal male university students under conditions of minimal sensory and 
motor stimulation. F1gur» 1 presents two-dimensional positron scans of the 
upper hndy taken 40 minutes after FOG Injection. 



Insert Figure 1 Here. 



As 1n the tomographic Images, t'»ie darker portions of the picture represent 
areas of hH'ier glucose metabolism. From Figure 1, 1t 1s apparent that 
heart and brain are organs with exceptionally high rates of qlucose 
metabolism. 



Visual Information Processing and Cerebral Glucose Metabolism 

The close relation between higher brain ^unction and brain metabolism 
or work Is clearly Illustrated 1n recent Investigations of the metabolic 
changes occurring during visual Information processing 1n man and animal. 
For oxanplp, Phelps and his colleagues (Phelps et al., 19fl0) examined the 
effects of the complexity of visual stimulation on brain CMRGlc using the 
FOG tracer and positron emission scanning Imagery. The first part of the 
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experiment Involved comparing the effects of different levels of visual 
stimulation 1n a group of l 1 ? neurologlcally normal, young, right-handed, 
male university students. Four conditions were tested 1n different subsets 
of this sample: 1) no visual stimulation produced by closing and/or 
patching the eyes, 2) a brightly Illuminated unpatterned visual field, 3) 
patterned stimulation, a black and white checkerboard pattern phase 
reversing at the rate of 2/sec, and 4) complex visual stimulation resulting 
from walking out of doors 1n a park. 

Using the FDG tracer and positron emission scan Imaging, a series of 
tomograms were obtained that included the full extent of visual cortex. In 
those tomographic sections, major structural features of the human brain can 
be ri1st1nqn1sher\ For example, the gray matter of the cerebral cortex may 
be s^en as an area of highly active metabolism, 1n contrast to the 
relatively hypometabollc white matter that lies beneath 1t. From these 
images, quantitative estimates of CMRGlc were obtained separately for leH 
and right primary (Brodmann's area 17) and association (Brodmann's area 18 
and l<i) visual cortex. 

Visual stimulation significantly increases the rate of glucose 
utilization in those cortical regions known to be primarily responsible for 
processing visual information. Furthermore, the demand for glucose 
Increases as a function of processing complexity. For example, Figure ? 
presents a representative set of tomograms obtained with the eyes closed anr! 
with white Unlit stimulation. 
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Insert Figure 2 Here. 



It may be observed that the only changes 1n CMRGlc between these two 
conditions occur 1n the visual cortex of the occipital lobe. The darkening 
of these areas under stimulation conditions reflects the Increased metabolic 
demands Imposed by Information processing 1n these areas. 



A quantitative summary of these results 1s presented 1n Figure 3. 



Insert Figure 3 Here. 



Relative to the eyes-closed baseline condition, simple white light 
stimulation resulted 1n an Increase 1n CMRGlc of about 12 percent 1n primary 
visual cortex (PVC) compared with about 6 percent 1n association visual 
cortex (AVC). This difference probably reflects the known fact that AVC 1s 
more concerned with processing spatial patterns than Intensity 
discrimination. The addition of patterned stimulation (the alternating 
checkerboards) Increased CMRGlc 1n both areas by approximately 25 percent as 
compared with eyes-closed baseline. When patterned stimuli were presented 
to only onp of the two eyes, there was a 37 percent decrease 1n metabolic 
response 1n PVC and an 18 percent decrease 1n AVC compared with binocular 
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stimulation. The relative magnitude of these decreases prohahly reflects 
the proportion of monocularly driven cells 1n these two areas; that 1s, 1n 
PVC ahout one third of all neurons could be expected to be monocularly 
driven by the unstimulated eye whereas 1n AVC most neurons have binocular 
Input. 

Finally, the complex visual stimulation of the park resulted 1n very 
laroe Increases 1n CMRGlc. In PVC mean CMRGlc Increased by 45 percent and 
in AVC by 59 percent. Local Increases within these areas were as large as 
100 percent of baseline metabolic rate. These data demonstrate that the 
metabolic demands of anatomically and functionally defined regions of human 
visual cortex Increase markedly 1n response to Information processing 
demands. These Increases 1n CMRGlc are not global but are limited to the 
relevant renions of cortex. Information processing in visual cortex clearly 
requires s rain work. 

All of !he changes 1n CMRGlc Induced by visual Information processing 
1n normal young adults were bilateral. No differences were present between 
the right and left cerebral cortices. In contrast, Phelps et al. also 
measured CMRGlc in seven neurological patients with homonymous hemianopsia, 
a condition in which the visual Input to one of the two cerebral hemispheres 
1s absent. This usually results from unilateral damage to the subcortical 
visual projection system. Figure 3 shows that, when compared with the 
normally Innervated hemisphere, the v1 jally deprived PVC and AVC are 
hypomptabollc. CMRGlc for denervated visual cortex is markedly suppressed, 
again illustrating the link hetween neuronal function and neuronal 
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metabolism. 

2.4 Implications for a Theory of Processing Resources 

These <*ata have several Implications for any theory of human 
Information processing resources. First, they demonstrate that selected 
task-relevant regions of cerebral cortex become activated during specific 
information processing tasks. Introducing visual stimulation had no effect 
on CMRGlc 1n the other classical sensory areas or association cortices for 
example. This finding suggests that restricted cortical regions serve as 
processing resources 1n the sense proposed by Norman ahd Robrow (1975). 
Further, the data Imply that cortical processing resources are specialized 
and multiple, a theme that v/111 form the focus of the following section. 

The second fact emerging from the study of cerebral glucose utilization 
during visual Information processing 1s that cortical resources are not 
continuously employed, but only become active when a cognitive task demands 
the services of that region. Thus, cortical resources are unlike computer 
processors, which usually grind their electronic wheels continuously, 
whether or not there 1s any real work to do. In this sense, cortical 
resources are not fixed 1n capacity, but rather are expandable, as Kahneman 
proposed (1°73). 

This concept 1s in accord with the analysis of resource utilization and 
processing costs offered by Navon and Gopher (1979). Navon and Gopher 
suggested that resources may have a cost associated with their utilization 
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and proposed that such a cost might be the consumption of "mental energy." 
The present data suggest that the cost metaphor need not be so loosely 
appl led; the utilization of cortical resources does indeed have a 
significant energy cost, one that is measurable 1n the rate of cerebral 
glucose metabolism. 

Thus, recent Investigations of task-induced changes 1n CMRGlc provide 
support for the Idea of multiple processing resources, expandable processing 
resources and real costs associated with resource utilization, ' 
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3.0 Regional Cerebral Blood Flow and Cerebral Specialization of Function 

^ h second approach to the metabolic mapplnq of reqional cortical 
activation 1n human Information processing concentrates on the 
cerebrovascular effects of the byproducts of neural metabolism. Cerebral 
blood flow 1s locally regulated in a manner that assures an adequate rate of 
oxygen uptake and carbon dioxide removal 1n the local cerebral 
microenvlronment. Cerebral blood flow 1s said to be autorequlated, in that 
1t is determined bv brain metabolic demands rather than systemic 
cardiovascular state. Thus, local cerebral blood flow 1s closely linked to 
the local rate of oxidative metabolism, which 1n turn reflects the local 
level of neuronal Information transactions. The primary controlling 
mechanism for cerebral blood flow 1s carbon dioxide, an end product of 
oxidative metabolism. Carbon dioxide appears to exert a direct vasodllatury 
effect on the cerebral vasculature and the maonitude of this effect is 
lame: increasing blood carbon dioxide by breathinn a r > percent mixture of 
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carbon dioxide Increases cerebral blood flow by 50 percent. A 7 percent 
mixture results 1n a 100 percent Increase 1n cerebral blood flow. Blood 
carbon dioxide levels sets the vasodilatory tone and therefore the basal 
rate of cerebral blood flow. 

The effect of local oxidative metabolic rate on blood flow operates 
against this background. As cerebral Information processing Increases the 
local metabolic rate, the output of carbon dioxide 1n the extracellular 
fluid of the hraln Increases producing a change 1n cerehral pH 1n the 
direction of acidosis. It 1s believed that the increase 1n extracellular, 
extravascular hydrogen 1ons mediates the local vasodilation of the cerebral 
mlcrovasculature, resulting 1n a local increase 1n cerehral blood flow 
(Ingvar and Lassen, 1976). For this reason, measures of regional cerebral 
blood flow (RCRF) may be used to assess the regional involvement of cortical 
tissue 1n human information processing. 

3.1 Measuring Regional Cerebral Blood Flow. 

Radioactive labelling methods provide the basis for most measurements 
of human regional cerebral blood flow, just as they were utilized 1n 
measurlnn CKRGlc. Most measures of RCBF employ radloactlvely labelled 
xenon, an Inert gas that diffuses freely across the blood brain harrier. In 
one version of the method, a bolus of labelled xenon 1s Injected Into one of 
the two common carotid arteries supplylnq the greater part of one cerebral 
hemlspherp. The gas Immediately passes from the blood Into brain tissue, 
where Its presence 1s detected by banks of radiation detectors placed 
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against the scalp. The rate of regional cerebral blood flow 1n the grey 
matter of the cortex 1s estimate*; from the rate at which this labelled gas 
diffuses back from brain to blood and 1s carried off and lost 1n the volume 
of the body. From the first two minutes of this so-called washout curve, 
the rate of RCBF is estimated. If RCBF is high, then the larger volume of 
blood flowing through the region washes out the diffused xenon more rapidly. 
If PCBF is low, the xencn disappears less quickly. By the use of careful 
measurement methods, an absolute rate n . CBF may be obtained. 

Although RCBF and ChuJc measures inilex complementary aspects of 
cerebral oxidative metabolism linked to brain work, the primary advantage of 
the RC P >F methods is that they are much older and therefore have had the 
opportunity to provide a substantive body of knowledge concerning reqlcnal 
cerebral involvement in human Information processing. It 1s to this 
literature that we now briefly turn our attention. 

3.? RCBF in the Restino Brain. 

In the absence of any explicit task the pattern of RCBF 1s far from 
uniform; flow rates for frontal cortex are 40 to 50 percent greater than in 
posterior regions. Ingvar (1976) has termed this resting distribution the 
hyperfrontal pattern and has suggested that 1t reflects a state of planninn 
on the part of the subject. No real evidence is available that clarifies 
the nature of coonitive processing during wakeful rest, but there is little 
question as to the existence of the hyperfrontal pattern. Sincn the resting 
distribution is not uniform, it has become common to characterize 
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task-evoked patterns of RCBF as deviations from the resting hyperfrontal 
pattern (Lassen, Ingvar and Sklnhoj, 1978). 



3.3 Sensory Processes. 

The presentation of simple sensory stimuli 1n the absence of any 
explicit cognitive task results 1n specific Increases 1n RCBF 1n the 
appropriate primary sensory cortex and not elsewhere. Lassen, Ingvar and 
Sklnhoj (1978) report that presentation of loud tones resulted 1n bilateral 
Increases 1n flow over primary auditory cortex. (See Figure 4 for a mapping 
of functional cortical areas.) 



Insert Figure 4 Here. 
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Similarly, Lassen, Roland, Larsen, Malamed and Soh (1977) have demonstrated 
chat llstenlnq to nuslc also results 1n bilateral blood flow Increases 1n 
primary auditory cortex (superior temporal gyrus) and thtt bilateral 
Increases In occipital RCBF accompanied the viewing of simple visual 
patterns. Unilateral tactile stimulation elicits contralateral Increases In 
RCBF over the appropriate region of the sensory-motor strip (Lassen et al., 
1°77). Finally, Fo1t, Larsen, Hattorl, Sklnhoj and Lassen (1980) report 
that stimulation of the median nerve of the arm results not only 1n the 
expected Increased RCBF of the contralateral sensory-motor hand area, but 1n 
the supplementary motor area as well. These may reflect the fact that the 

134 

126 



CEATTY: NEUROPHYSIOLOGY OF PROCESSING RESOURCES 

median nerve 1s mixed, containing both sensory and motor fibers. 



3.4 Motor Processes. 



Several reports have been published demonstrating specific changes 1n 
RCBF 1n motor tasks. Roland, Sklnhoj, Larsen and Lassen (1^77) have shown 
that static contraction of the hand produces Increased RCBF over the 
contralateral sensory-motor hand area, lv contrast, execution of complex 
patterns of finger movement Induces Increased flow both 1n the contralateral 
hand area and 1n the supplementary motor area as well. Imagining the same 
movements resulted 1n Increases 1n the supplementary motor area alone, 
without activation of the contralateral sensory-motor strip. Finally, 
sequential movement of fingers to target positions elicited contralateral 
sensory-motor, supplementary motor area and parietal lobe increases 1n RCBF. 



The role of the supplemental motor area 1n complex movement was further 
Investigated by fVgagozo and Larsen (197?)) using RCBF measures. They found 
that sustained fcot contraction produced Increases only in the contralateral 
sensory-notor foot area, whereas complex foot movements resulted 1n 
Increases 1n the supplementary motor area as well. Similar patterns were 
seen for comolex finger movements. 



Specific regional increases 1n RCBF have also been reported during eye 
movements and visual search. Melaned and Larsen (1977) found that eye 
movements produced bilateral Increases in the sensory-motor face area, the 
frontal eye fields, the supplementary motor area and primary visual cortex. 
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3.5 Language Processing. 

Language processing Involves the differential activation of the 
classical speech areas. These effects often are bilateral. Thus, Lassen, 
Ingvar and Sklnhoj (1978) report that the auditory presentation of simple 
words activates the superior temporal gyrus (auditory cortex) and Wernicke's 
area of both cerebral hemispheres. Complex verbal stimuli produce these 
effects as well as bilateral Increases 1n RCBF over Broca's area In frontal 
cortex. 

The effects of 'non-linguistic 1 or automatic repetitious speaking were 
studied by Larsen, Sklnhoj and Lassen (1978) who tested 18 right-handed 
persons while either counting repeatedly to 20 or reciting the days of the 
week. Focal Increases occurred bilaterally 1n the sensory-motor face and 
mouth area, supplemental motor cortex, auditory cortex and Wernicke 1 s area. 
No increase 1n flow over Broca's area was reported. Perhaps most 
unexpectedly there was a relative Increase in mean cerebral blood flow over 
the right, not the left, cerebral hemisphere. 



3.6 Implications for a Theory of Processing Resources. 

The finding that functionally specialized regions of human cerebral 
cortex become differentially activated 1n different types of Information 
processing tasks provides strong support for multiple resource theories. At 
Its highest levels, the nervous system does not function as a generalized 
pool of processing capacity, although exactly that hypothesis was put 
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forward by Lashley (1029) 1n his theory of cortical mass action. Instead, 
different regions within the cerebral hemispheres are specialized to perform 
qualitatively different Information processing functions. The experimental 
mapping of regional cerebral blood flow reviewed above provides clear 
evidence of this differentiation 1n cortical function. Thus, the RCBF 
literature serves as a source of converging neuroMologlcal data 1n support 
of multiple resource theory. 



Perhaps more Importantly, these data may serve a heuristic function for 
connltlve theory. The problem of resource identification has plagued thp 
development of multiple resource theories, forcing Navon and Gopher (1979), 
for example, into a purely theoretical account that Ignored the question of 
resource specification. The literature on regional cerebral blood flow 
provides a potential solution to the problem of resource identification. 
These data suggest that each of the three major sensory projection areas 
witb their adjacent association areas may be differentially activated during 
information processing Thus, they may constitute putative processino 
resources. Similarly, the motor strip and adjacent premotor cortex pay also 
constitute a cortical resource, hut the distinction between motor and 
somatosensory cortex is far from cle^r. Further, the supplementary motor 
area of prefrontal cortex appears to serve as a coordinating or executive 
reqlon for the execution of complex, voluntary patterned movements. 
Final lv, the interconnected Wernicke and Rroca's areas possibly in 
conjunction with homologous regions 1n the non-dominant hemisphere are 
differentially activated during languaqe processing. These distinctions are 
Important for coqnltive psychology as they provide physiological evidence 
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relevant to the problem of resource identification. If two cognitive tasks 
involve the same cortical regions, they may he thought to invoke the same 
combination of cortical processing resources. By studying the similarities 
in tasks which activate the same cortical regions, major advances may be 
made in transforming multiple resource theories from formal constructions 
that lack substantive content into a powerful and specific theory of the 
structure of human cognition. 

4.n Reticular Activating System and Cortical Processing Resources. 



Considerable attention has been paid to the question of the involvement 
of subcortical structures in the complex information processing functions of 
the human brain. This line of research stems from Moruzzi and Magoun's 
(1949) now classical discovery of the functional properties of the brainstem 
reticular formation. The interaction of cortical and subcortical structures 
in higher cognitive functions has been elegantly summarized by Luria, the 
late Russian neuropsychologist: 



"The maintenance of the optimal level of cortical tone is 
essential for the organized course of mental activHy. (However,) 
the structures maintaining and regulating cortical tone do not He 
in the cortex Itself, but below it, 1n tUe subcortex and brain 
stem. (These structures together are the reticular formation.) 
Some of the fibres of this reticular formation run upwards to 
terminate 1p higher nervour structures such as the thalarjS, 
caudate body, archlcortex and, finally, the structures of the 
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neocortex. (They play) a decisive role 1n activating the cortex 
and regulating the state of Its activity. Other fibres of the 
reticular formation run 1n the opposite direction. (These 
descending fibers) subordinate the lower (brainstem) structures to 
the control of programmes arising 1n the cortex. The higher 
levels of the cortex, participating directly 1r» *hp formation of 
Intentions and plans, recruit the lower systems of the reticular 
formation of the thalamus and brain stem, thereby modulating their 
work and making possible the most complex forms of conscious 
activity." (from Lurla, 1973, pp. 45-60) 

The dynamic interaction of cortical structures and the reticular core as an 
essential feature of ' ^ neurophysiology of cognitive processing 1s also 
enphasizod by Brodal (1981) in his recent authoritative review of the 
neuroanatomy of the reticular activating system: 

"The cerebral influence on the reticular formation cart hardly be 
overrated. Our neneral alertness 1s influenced by words we hear, 
scones »ve see, and processes which require consciousness and 
interpretation of perceptions and which certainly are dependent on 
cortical activity. ..It appears very likely that cortlcoretlcular 
projections are involved 1n these processes (p. 143). 

It is... common experience that Increased attention and alertness 
are accompanied by an increased heart rate and often also other 
autonomic phenomena. This is easily explained by the general, 
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ascending and descending, actions of the activating system (cf. 
axons with dichotomizing, ascending and descending branches 1n the 
reticular formation), (p. 442)." 



The fact that the ascending and descending Influences of the reticular 
activating system share common efferents suggests a method for measuring 
reticular Involvement 1n cognitive processing; the measurement of 
task-evoked changes 1n the output of the descending activation system. 



4.1 Pup1llometr1c Measurement of Task-evoked Reticular Activation 



Electrical measurement of reticular formation activity 1n man 1s a 
practical Impossibility, as this structure consists of a dense network of 
cell hodles and fibers burled In the central core of the brainstem, 
extending from the medulla to the dlencephalon. However, as Indicated hy 
Brodal, the state of reticular activity can be assessed Indirectly from Its 
effect on the autonomic reflexes, which are controlled by autonomic nuclei 
1n adjoining r«g1ons of the brainstem. These nuclei have Intimate 
connections with the reticular core. Of the autonomic functions regulated 
1n this area, the pupillary system appears to be uniquely well-suited for 
the assessment of reticular activity during Information Processing 1n man. 
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Pupillary illation has been used as a primary measure of reticular 
act1v.jt.lon 1n neurophyslologlcal research since the Dloneerlng 
Investigations of reticular function by Moruzzl and his co-workers (see 
Moruzzl, 1972, fn- a detailed summary). There are at least two reasons for 
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this choice. First, the mechanical construction of the 1r1s 1s such that 
small changes 1n either sympathetic or parasympathetic discharge may be 
discerned. Second, the central connections of the Iridic musculature are 
Intimately linked to nuclei controlled 1n large part by the reticular 
activating system. 

Superimposed upon the tonic level of pupillary dilation, orderly phasic 
responses may be decerned during the performance of mental tasks. These 
task-evoked pupillary responses are time-locked to ongoing mental events. 
They are rapid 1n onset, being measurable at latencies of a few hundred 
milliseconds. They are relatively small 1n extent, reaching a maximum value 
of approximately .50 - .70 mm. at high processing loads, and terminate 
within a few hundred milliseconds of completion of cognitive processing. 
The amplitude of these task-evoked pupillary responses appears to be 
quantitatively related to the momentary level of processing demands Imposed 
by a cognitive task. 

In the fifteen years since pupil lometrlc measurements were Introduced 
to modern psychology (Hess and Polt, 1^64), pupillary responses have been 
analysed !n a variety of Information-processing tasks. A number of 
cognitive domains have been Investigated, Including memory, perception, 
attention, language processing and reasoning. In each of these areas, 
quantitative relations between processing da ands and the maqnltude of the 
task evoked pupillary response have been established. The results of these 
experiments are summarized helow. 
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4.2 Mental Arithmetic. 

Mental arithmetic has been used as an example of a complex reasoning 
problem by several Investigators Interested 1n the pup1llometr1c analysis of 
processing demands. Hess and Polt (1964), 1n their Initial and Influential 
article on pupillary signs of mental activity, measured pupillary diameter 
as 5 subjects solved 4 multiplication problems, ranging 1n difficulty from 7 
X 8 to 16 X 23. For each of the subjects and each of the problems, 
pupillary diameter Increased from the moment of problem presentation until 
the point of solution. Across subjects, the percentage dilation was 
perfectly ordered by presumed problem difficulty. 

These results were subsequently replicated by Ahem (Ahem, l"7fl; 
Ahem & Reatty, 1979, 1n press). Three levels of problem difficulty were 
employed, ranging from multiplying pairs of l-d1g1t numbers to multiplying 
pairs of ?-d1g1t numbers. In this task an Initial dilation of approximately 
.15 mm accompanies the encoding and storage of the multiplicand. The second 
and major dilation 'allows presentation of the multiplier and continues 
through problem solution. Both the amplitude and latency of this latter 
dilation Increase as a function of problem difficulty. In the most 
difficult condition, the response appears to asymptote at approximately .SO 
mm. An example of these task-evoked pupillary responses 1s shown 1n Figure 
5. 
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Insert Figure 5 Here. 



4.3 Short-term Memory. 

The demands placed upon short-term memory formed an Initial and 
enduring problem 1n the pup1llometr1c study of 1nformat1on-process1nq load. 
Kahneman and Beatty (19fi6) presented the first pup1llometr1c analysis of the 
process 1 no demands encountered 1n a short-term memory task. Strings of 3 to 
7 digits were auditorily presented at the rate of 1 per sec. After a two 
second pause, subjects were required to repeat the digit string at the same 
rate. Under these conditions, pupillary diameter Increases 1n a linear 
fashion with the presentation of each digit, reaching a maximum 1n the pause 
preceding report. As digits are unloaded from memory during report, 
pupillary diameter decreases with each digit reported, reachlnq baseline 
levels after report of the final digit. The magnitude of the peak pupillary 
dilation during the pause between Input and output 1s an Increasing. function 
of string length. In unpublished work, Kahneman and Beatty observed that if 
the subject were requested to repeat the string a second time Immediately 
after reporting the final digit, the pupil Immediately dilates to the peak 
diameter for that string and then decreases with each digit spoken until the 
entire strlno has been reported for the second time. Beatty and Kahneman 
(1°66) demonstrated that a similar pupillary function 1s obtained when a 
strinn of itoms is recalled from long-term memory for report. 
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Pupillary dilation 1s also determined by the difficulty of the 
to-be-remembered Information as measured by mamory span for different types 
of Items (Kahneman and Beatty, 1966). Three conditions were tested: recall 
of four digits, recall of four unrelated nouns, and transformation of a four 
digit string by adding one to each Item. Steeper slopes were found for the 
more difficult Item strings. 

The capacity of short-term memory for strings of unrelated digits 1s 
approximately seven (Miller, 1956). Peavler (1974) measured the task-evoked 
pupillary response for strings of 5, 9 antf 13 digits. During presentation 
of the strings pupillary diameter Increases as an approximately linear 
function of memory load for digits 1 through 7. At the seventh or eighth 
digit, the pupillary response reached an asymptote; no further dilation was 
ohserved. Thus, the task-evoked pupillary response reflects Increasing task 
demands nnly within the region of that function where adequate performance 
may be maintained. 



4.4 Language Processing. 

Several levels of lanquage processing have been studied 
puplllometrically. At the most molecular level, Beatty and Wagoner (197R) 
used an experimental method developed by Posner and his colleagues (Posner & 
Mitchell, 1967; Posner Boles, 1971) to study the visual encoding of 
single letters. In their first experiment subjects were required to judge 
whether or not a pair of visually-presented letters had the same name. 
Individual letters were presented 1n either upper or lower case type. Thus, 
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two sorts of letter pairs could he judged to be the same by the name 
criterion. If both letters are presented 1n tho same case (e.q. AA or aa) 
only the physical features of the letters need by analyzed to reach the 
correct judgment. If they differ 1n case (e.g. Aa or bB) then, 1n addition 
to analyzing the physical features, a second step of name code extraction 
must be performed. Although the task-evoked pupillary responses were small 
1n this simple task (on the order of .1 mm), the pupil was sufficiently 
sensitive to indicate the extra processing step required for name code 
extraction. Significantly larger responses were obtained for letter pairs 
that differed in case. 

In a second similar experiment, Beatty and Wagoner examined three 
levels of character encoding by requiring the use of a third, higher order 
category for classification (vowels and consonants). Some letter pairs 
could either be physically identical, identical in name, or Identical 1n 
category membership (e.g. Ae or BK). Again, the task-evoked pupillary 
response differentiated the additional processing load required to perform 
the letter-matching task at each level. 

Ahem (Ahem, 1H7P; Ahern R Beatty, 1979) undertook two experimental 
investigations involvinq language processing. The first of these 
experiments examined task-evoked pupillary responses in the perception and 
comprehension of words. Subjects were required to judge pairs of words as 
similar or different in meaning. The first word of each pair v/as drawn from 
either the easiest or the most difficult items of one of three psychometric 
vocabulary tests. The second word, presented two seconds later, was either 
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a synonym of the first, or quite different 1n meaning. A dilation of 
approximately .1 mm followed the presentation of the easy target words, 
whereas the dilation to the difficult target words was twice as large. A 
second dilation followed the presentation of the comparison word, yielding 
pupillary dilations of .30 and .35 mm. respectively during the judgment 
period. 

In the second experiment, Ahem (Ahern, 1978; Ahem ft Reatty, 1979), 
employing Saddeley's Grammatical Reasoning Task (Baddeley, 1968), presented 
sentences of the form H A follows B" or "B precedes A 11 after which an 
exemplar M AB M or H BA H was given. The task was to determine whether the 
sentence correctly described the exemplar. Sentences differed 1n 
grammatical complexity, being act1ve-pos1 tlve, active-neoatlve, 
passive-positive or passive-negative. Although these sentences differed In 
length, sentence duration was held constant by using computer presentation 
of digitized natural speech. In this task, Increasing dilation was, observed 
c 1ng the presentation of the sentence and the exemplar which peaked during 
the decision interval. The amplitude of thest responses averaged 
approximately .40 mm and differed significantly as a function of grammatical 
complexity, with the longer, more complex sentences el1c1t1nq larger 
pupillary responses. 

Wright *nd Kahneman (1971) have also applied pupil lonetMc measurements 
in a sentence processing task. Subjects were presented with rercolex 
sentences jf the form: "The qualified managing director was recently 
sensibly appointed hy the expanding successful company. " When required to 
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repeat the sentence, the task-evoked pupillary response Increased as the 
sentence was presented, and peaked during the retention Interval (3 or 7 
sec), reaching a maximum dilation of approximately .30 mm. 

4.5 Perception and Attention. 

Small but reliable pupillary dilations accompany the detection of both 
visual and acoustic signals at near threshold Intensities. Hakerem and 
Sutton (1966) provided the first pupil lometrlc analysis of processing load 
in perceptual detection. Subjects viewed a uniform visual field upon which 
brief increments 1n luminance could he imposed with the left eye as 
pupillary diameter of the right eye was measured. When the magnitude of the 
intensity increment was adjusted to yield 50?! correct detection, all 
vestiges of the flash-induced light reflex disappeared. A clear pupillary 
dilation of approximately .10 mm was observed if and only 1f a presented 
target was detected. This dilation reflects the processing of detected 
targets. Although of small magnitude, these task-evoked responses to 
detection were highly reliable. 

Beattv and Wagoner (1077) extended Hakerem and Sutton*s finding to 
audition, using weak 100 msec 1kHz sinusoidal acoustic signals presented 
against a hackoround of white noise with a probability of 0.50. After each 
trial, the subject rated his certainty that a target had or had not been 
presented (Green J Swets, 1966). For signal present trials, the magnitude 
of the task-evoked pupillary response was an increasing function of rated 
detection certainty, being largest for certain detections and smallest for 
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certain rejections. 

Task-evoked pupillary responses have also measured 1n perceptual 
discrimination tasks, 1n which a presented stimulus must be compared against 
memory and a judgment rendered. Kahneman and Beatty (1967) reported the 
first study of the pupillary response 1n pitch discrimination. The 
amplitude of the response to the comparison tone varied as an increasing 
function of discrimination difficulty, ranging from approximately .10 mm for 
easy to .20 mm for difficult discriminations. 

4.f» Inter-task, Comparisons. 

The data summarized above consistently Indicate that tasks which place 
large demands on the information processing system, judged behavioral ly, 
subjectively or by an analysis of task requirements, elicit large 
task-evoked pupillary responses. Less demanding tasks elicit smaller 
responses. An intriguing possibility 1s that this pup1llometr1c measurement 
of central nervous system activation associated with cognitive functions 
might provide a common metric for the assessment and comparison of 
information-processing load 1n tasks that differ substantially 1n their 
functional characteristics. 

This possibility 1s strengthened by the finding that the maqnltude of 
the task-evoked pupillary responses during cognitive processing 1s 
Independent of hastHne pupillary diameter over a physiologically reasonable 
range of values (Rradshaw, 1969, 1970; Kahneman and Reatty, 1°67; 
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Kahneman, Beatty and Polla-k, 1967). Therefore the absolute values of the 
task-evoked dilations reported from different laboratories for qualitatively 
different tasks may be meaningfully compared. 



Insert Figure 6 Here. 



Figure 6 presents such a quantitative comparison, giving the peak amplitude 
of the task-evoked pupillary response for each of the tasks detailed above, 
subject only to the constraint that the data are not confounded by the 
effects of overt respondlnn. 

The leftmost panel of Figure 6 presents peak dilations for short-term 
memory tasks. The data for short-term retention of digits are the average 
of the values obtained by Ahern (1°78), Kahneman and Peatty (19*6), 
Kahneman, Onuska and Wolman (1068), and Peavler (197^). The value for 
retention of ^ words 1s from Kahneman and Beatty (1°66). The next panel 
summarizes the literature on language processing. The peak value for the 
letter matching task is the average of both experiments published by Beatty* 
and Uagoner (1Q7P,). Sentence encod1nq-l 1s from Wrlqht and Kahneman (1?71). 
Sentence encoding-? 1s from a sentence repetition task recently completed by 
Beatty and Schluroff. All other values for language processlnn tasks are 
taken from Ahern (l°7fi). Word encoding 1s the response to the presentation 
of the first word in the synonyms judament task. The values for easy and 
hard word matching are -the peak response during the judgment period 
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following presentation of the second word 1n that task. The value for 
grammatical reasoning 1s the average of the four types of sentences 1n 
Baddeley's Grammatical Reasoning Task. 

The third panel presents data from the mental multiplication task used 
as an example of complex reasoning. Only Ahem (1978) has presented 
task-evoked pupillary responses for this task which are necessary for 
comparative peak measurement. Multiplicand storage 1s the amplitude of the 
peak response to the first Item 1n the multiplication task. The other three 
values are the peak amplitudes attained during problem solution. 

The rightmost panel presents data for perceptual tasks. The visual 
detection data are from Hakerem and Sutton (1966) and the auditory detection 
data are from Beatty and Wagoner (1977). The discrimination data are taken 
from Kahnenan and Beatty (1967). 

Several points concerning these data deserve mention. First, the data 
are tolerant of the stringent demands placed upon them 1n comparing absolute 
dilation values across experiments. Usually rescallng of some sort 1s 
required for physiological data to remove Individual differences 1n 
responslvity (Johnson and Luhln, 1972). No such rescallng was undertaken 
here. The data plotted are absolute peak dilations obtained from different 
groups of subjpcts performing a wide range of cognitive tasks under varying 
experimental conditions 1n different laboratories. Second, the data plotted 
In Figure 6 are internally consistent. No abnormally large or small values 
are present. Third, the ordering of these values corresponds closely to an 
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ordering of these asks using other criteria of information-processing load. 
The short-tern memory tasks cover a large range of values, depending on the 
number of items held for recall. Similarly in language processing the 
sentence comprehension tasks yield large pupillary dilations whereas the 
simpler word and letter matching tasks elicit much smaller values. The 
mental multiplication tasks also span a wide ranje of values, each 
appropriate to the difficulty of the particular problem. Fina-iy, the 
perceptual tasks, which behavioral techniques indicate impose negligible 
processing load, are associated with small task-evoked pupillary responses. 
Taken as a whole, these data indicate that the task-evoked pupillary 
response m<w serve as an independent, physiological indicator of the 
momentary level of information procession load imposed upon the nervous 
system by cognitive tasks. 

5.0 Toward a Neuronhysiolopical Theory of Processing Resources 

Several factr have emerged from this review of the neurophysiology of 
cognitive processing. The first is that the cerebral cortex is not an 
undifferentiated structure, but rather that it is functionally specialized 
in anatomically definable renions. This conclusion is supported by bot*. 
measures of regional cerebral glucose utilization and blood flow, each an 
indicator of the local rate of oxidative metabolism lesulting from 
information transactions in cortical tissue. It is further corroborated by 
the i„ ropsychologiral studies of the effects of restricted brain lesions, 
although that literature was not reviewed here. These data aive strono 
support to the idea that at the 'evel of cerebral cortex multiple resource 
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theory provides an appropriate description of the structure of the cognitive 
system. Furthermore, the neurophyslologlcal data provides an Initial 
definition for these mult^le resources that may be confirmed by appropriate 
behavioral tests. 

Second, these cortical resources have been shown to be expandable, in 
the sense suggested by Kahneman (1973). Cortical resources are not 
continuously active, but rather are allocated as appropriate given the 
cognitive tasks facing the person. Furthermore, there 1s a real physical 
cost associated with resource utilization; cortical resources require 
biologically large amounts of energy to process Information. ThU finding 
is in accord with the analysis, of resource utilization and processing costs 
offered by Mavon and Gopher (1979) 1n their microeconomic model of cognitive 
processing. The fact that processing costs are real sugqests that cortical 
resource utilization might be a carefully regulated biological process. 

The third finding is that at the level of the brainstem activation 
systems, there 1s strong support for the Idea of a general processing 
resource as Kahneman (1973) proposed. Across a wide range o-f qualitatively 
different information processing tasks, the amplitude of tHe task-evoked 
pupillary response appears to provide a conslstant and sensitive indication 
of the agoregate demand for cortical processing resources. 

Thus human neurophysiology has provided some answers to the major 
questions faring connitive theory. Put these data also suggest that the 
presently available models of the structure of processing resources may not 
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be comnletely adequate. The problem 1s that there exists compelling 
neurophyslological evldencp for both multiple processing resources at the 
level of cerebral cortex and a single general resource at the level of thp 
brainstem. How are these two sorts of data to be reconciled? One answer to 
this question was suggested by an analogy proposed by Kahneman and Beatty in 
1967: 

"The frequent use of the concept of processing load 1n the present 
paper has been guided by a simple analogy: consider a houseful of 
electrical devices that are variously put 1n operation by manual 
switches or by their own internal governor systems. The total 
amperage demanded by the entire system at any one time may easily 
be read on an appropriate electrical instrument outside the house. 
Processing load is here construed as analogous to an aggregate 
demand for power, and there is ground for the hope Lhat the pupil 
nay function as ? useful approximation to the relevant measuring 
device. (Kahneman and Beatty, 1967, page 104)" 

This analogy suggests a neurophyslological framework for a structural theory 
of processing resources. Specific, multiple processing resources of modern 
connitivp theorv are identified with regionally restricted cortical areas of 
specialized function. Thus, *n addition to behavioral data obtained in 
timesharing experiments (Navon and Gophe-, 1979), neurobiologlcal data may 
also be employed to identify u ~» characterize probable specific orocessing 
resources. Neuropsycholooical data detailing cognitive deficits followlnn 
restricted cortical lesions appear to be relevant (Walsh, l<7fl; Hecaen and 
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Albert, 1978; Hellman and Valensteln, 1979). Tasks which show Impairment 
as the result of a specific, restricted brain lesion may be assumed to draw 
heavily upon at least one cortical resource In common. Similarly, recently 
developed methods of metabolic mapping Including regional cerebral blood 
flow (Ingvar, 1976) and cerebral glucose utilization (Sokolov, 1979; Phelps 
et al., 1979) provide Information concerning the regional Involvement of 
specific cortical regions 1n neurological ly normal Individuals performing a 
variety of cognitive tasks. Thus, the analogy suggests the view that 
restricted, functionally similar regions of cerebral cortex may serve as 
specialized specific processing resources, a perspective that not only is 
compatible with modern multiple resource theory, but provides several 
sources of enrichinn, convorgent information. 

Tho analogy also suggests that the qeneral processing resource may be 
identified with the aggregate demand from these regional cortical resources 
for activation from the reticular core. For reasons outlined above, the 
task-evoked pupillary response may be Interpreted as an Indicator of the 
momentary extent of cortlcally controlled reticular activation elicited 
drnrlno thr pxecution of a particular cognitive process. It should be noted 
that tho pupil is not the only Indicator of reticular activation. Similar 
findings have heen reported 1n both tho autonomic (Kahneman, Tursky, 
Shapiro, and Cririer, 1069) and skeletal-motor (Tuttle, 1924) systems. 
Within the skeletal-motor system, the amplitude of monosynaptic stretch 
reflexes Increases dramatically during cognitive processing, an effect that 
has been attributed to an Increase 1n excitability of the namma-efferent 
syston, Note 3), which is a classical inolcator of reticular outflow 
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(Granit, 1970). Th.s line of argument leads to the conclusion that 
coqnitlve processing elicits dynamic changes in the momentary level of 
activity in the ascending (and descending; see Brodal, 1981) reticular 
activating system in a manner quantitatively related to within-task and 
hetween-task variations in processing load. What is then required is a 
definition of the mechanisms linking the specialized forehrain processing 
resources and the general reticular activating system. 

Thp mechanism by which the reticular core may activate or facilitate 
information processing in cerebral cortex has remained unclear for several 
decades following Koruzzi and Magoun's (1949) discovery of the behavioral 
activating properties of the reticular core. However, recent r»ata from 
Schiebel (1^80) and Skinner (Skinner, 1979) have suggested a solution to 
this problem. They propose that the reticular system functions to modulate 
an inhibitory gating mechanism controlling thalamocortical communication in 
the forehrain. The essence of this suggestion is that reticular activation 
momentarily expands the processing capacity of the forebrain by facilitating 
communication in the intrinsic thalamocortical pathways. Such a mechanism 
is in close accord with the electrical analogy for processing resources and 
processing load originally proposed by Kahneman and Realty (196/) to account 
for the regularity of their pupil lometrlc data. 

This npurophysiolooical framework for a hierarchical theory of 
process i nq resources both provi des a perspect 1 ve for further theory 
development and suggests relevant sources of data from both the disciplines 
of cognitive psycholoay and the neurosciences. Perhaps most importantly, 
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this framework for a theory, even without elaboration, qenerates a number of 
testable hypotheses. Pairs of tasks may be expected to Interfere with each 
other as they compete for cortical resources. Therefore, either the 
neuropsychological literature or the developing regional brain metabolism 
literature may be used to predict pairs of tasks that are most likely to be 
mutually Interfering. This prediction Implies that these neurophyslologlcal 
findings are useful 1n specifying structure-specific (Wlckens, 197?) 
processing resources. Further, since the model 1s hierarchical, one might 
expect that tlmeshared tasks using markedly different cortical resources 
might nonetheless be limited by the total reticular activation available and 
therefore show Interference. Such hypotheses should be testable by 
pupil lometrlc methods. 

The Immediate importance of this approach 1s that it suggests ways in 
which the strong puplllometric evidence for a general concept of processing 
load is compatible with behavioral evidence for multiple, specialized 
processing resources. The resulting model 1s hierarchical in form and may 
provide a means of integrating the several, apparently discordant facts that 
distress simpler models. 

In addition this perspective also may offer longer range advantages. 
The types of theories spawned from a neurobehavloral framework are likely to 
be richer in several ways than are theories developed from more limited 
nerspectives. Such a theory is likely to be relevant to questions posed in 
both thn psychological laboratory and the neurological clinic. Py employing 
a wider range of material in theory construction, unprofitable theoretical 
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branches are more likely to receive early pruning and interesting 
possibilities are more likely to be nurtured rather than discarded. 

Finally, a neurobehavloral framework brings to the question of human 
information processing resources a richness of empirical data that 1s 
commensurate with the complexity of the theoretical problem. 
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Figure Captions 

Figure 1. Two dimensional FDG scans of a human subject 10 minutes 
after Intraveneous injection. High concentrations of FDG m*> he saen 1n the 
brain and heart. The three images are taken three ancles with respect to 
frontal view (60 degrees right, frontal, and 60 degrees left). (From 
Mazziotta et al., 1980) 

Figure 2. Cerebral metabolic rate for glucose in visual cortex 
reflects visual information processing demands. (A) Sketches from actual 
brain slices a* approximately the level of the cross-sectioral tomographic 
images. The blackened areas at the posterior of the brain indicate 
approximate locations of primary (PVC) and association (AVC) visual 
cortices. (R) v etaholic activity in visual cortex 1s low when the eyes are 
closed and patched. (B) Stimulation by white light. Increasing CMRGlc is 
indicate by increasing shades of gray, with black being the highest. Higher 
metabolic rates nay he seen for PVC and AVC (arrows) with stimulation. (P) 
Metabolic response of the visual cortex while viewing a complex scene. 
(From Phelps, KuM and Mazziotta, 1P81.) 

Figure 3. f'ean increase in the glucose metabolic rate of the PVC and 
AVC in homonymous hemianopsia and with increasing complexity of visual 
stimulation. Data are percentage increases over eyes-closed controls. The 
rate of increase for AVC is relatively greater with complex patterned 
stimuli. (From Phelps et al., 1^80.) 



16? 

159 



BEATTY: NEUROPHYSIOLOGY OF PROCESSING RESOURCES 



Figure 4. A schematic view of the left cerebral hemisphere with msjor 
functional areas Indicated. 

Figure 5. Averaged task-evoked pupillary responses obtained 1n mental 
multiplication. Responses for three levels of problem difficulty are 
plotted. The magnitude of the response during the solution period Increases 
as a function of processing demands. (From Ahern, 1973.) 

Fiqure 6. Peak amplitudes of the task-evoked pupillary responses 
obtained in a rage of qualitatively different cognitive tasks, arranged by 
typo of task. Tho pupillary response provides a reasonable ordering of 
tasks on the basis of presumed processing load. See text for rurther 
details. 
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ELECTROPHYSIOLOGICAL STUDIES OF ATTENTION 
IN MAN 



STEVEN A. HILLYARD 



I would like to describe some electrophysiological studies of atten- 
tional mechanisms 1n man. Our goal has been to gain Insight Into the structure 
of attentlonal processes by recording the time-locked electrical fields that 
arise from the bra^n during the processing of sensory Information. 

What most of us 1n physiological research would ultimately like to ac- 
hieve 1s to specify exactly what brain circuits and what brain structures are In- 
volved 1n the processing of Information and 1n selecting among stimulus Inputs. 
For this type of understanding, however, we'll probably have to wait a few de- 
cades, or even longer because we're still very far from being able to localize 
specific Information processing activities 1n the human brain. Scalp electrodes, 
1n particular, provide a very limited perspective cp the underlying patterns of 
neural activity and the brain structures that are active. However, the recording 
of neural correlates from the scalp as markers of underlying attentlonal and cog- 
nitive processes, does, I think, provide Information that will help us understand 
more about the nature of those processes. 

I think that the recording of neural correlates from the scalp can be of 
use 1n several ways. Our general strategy has been to record evoked potentials or 
event related potentials (ERPs), as they're called, from the scalp in carefully 
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structured behavioral circumstances where we have an idea of what perceptual or 
attentional processes are active. Having identified a wave as a marker of a more 
or less well-defined attentional process, we can go on to new tasks and get some 
Idea of the timing of that process in a different processing sequence, and see how 
1t Interacts with other processes that have also been marked with ERPs. Specifi- 
cally, by looking at the latency of ERPs, we can get information on the question 
of which stages or what levels of processing participate 1n attentional selections 
- the old issue of early selection versus late selection. We can also get infor- 
mation about how much processing is accorded to unattended or irrelevant material 
to which the subject is not making behavioral responses. Typically, a stimulus 
situation involves many more stimuli that one can respond to behavioral ly , and the 
behavioral readout provides a very limited view of how a person is processing a 
multi-dimensional stimulus array. By examining the ERPs which are triggered by 
all the stimuli that are being presented, one can gain a more complete picture of 
how each type of stimulus is being processed. So, the ERP technique provides 
another way of looking at perceptual processes in the brain which has advantages 
and disadvantages in relation to purely behavioral approaches; as a long-term goal 
we're trying to bring these two lines of research Into correspondence with one 
another. 
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The first slide shows the general approach, with electrodes placed on the 
head of a typical subject and the raw, amplified EEG shown above. When a warning 
flash and a signal click are presented you don't see very much in the raw EEG, be- 
cause these stimuli evoked ERPs in the brain which are just a few microvolts in 
amplitude while the on-going EEG "noise 11 is of the order of tens of microvolts. 
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v So, 1n order to see these tiny evoked brain signals, we have to use a signal aver- 
aging computer and summate the ERPs to large numbers of signals, typically between 
tens and hundreds of signals. The end result 1s the averaged event-related poten- 
tial, and 1f we just look at the ERP to the click signal, we can see a series of 
potential oscillations, some sixteen consistent waves, each with its own label. 
These waves are quite reproducible from one person to the next, and a number of 
them have been associated with anatomically localized neural activity. Within the 
first ten milliseconds after a click 1s a series of oscillations which can be lo- 
calized to specific brainstem auditory pathway nuclei as the message ascends through 
the brainstem on its way to the cortex (Stockard & Rosslter, 1977; Starr, 1978). 
The wave V at a latency of 6 msec comes from the mid-bra1n, probably the inferior 
colliculus, and wave VI probably from the vicinity of the medial geniculate body. 
Somewhere in the latency range of 10-15 msec the message arrives at the cortex and 
gives rise to a whole series of late cortical waves. It is in the late, presumably 
cortical activity beyond 50 msec where the ERPs associated with selective attention 
are seen. 

Early experiments in our laboratory (Picton & Hillyard, 1974) were designed 
to examine the old descending Inhibition theory of auditory attention (Hernandez-Peon, 
1966) which proposed that attention to non-auditory stimuli resulted 1n a gating of 
auditory transmission in the brainstem via efferent inhibitory pathways. Contrary 
to this hypothesis, we found that the ampi;._de of the brainstem evoked potential 
to clicks did not vary under conditions of attention versus those of inattention. 
Thus, we obtained no evidence for gating of auditory input at the brainstem level. 

The most consistent changes in ERPs with selective auditory attention were 
observed in a broad negative wave which begins at about 50 sec and peaks at 100 msec; 
this wave is termed Nj, since 1t was the first negative wave that was discovered by 
Davis and associates. Most of my talk will be cr cemed with how this negative ERP 
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varies as a function of what a person 1s attending to. The second slide shows the 
standard paradigm that we've been using for a number of year to elicit these nega- 
tive ERPs related to attention. (Hlllyard et aL , 1973, 1978). 
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In this task stimuli such as tone pips are delivered to different sensory 
channels; 1n this case we present high frequency tones to one ear and low frequency 
tones to the other ear. These tones are given 1n random order to the two ears at 
a rapid rate, typically at Intervals of 200 to 400 milliseconds or about th-ee beeps 
per second. This places a very high load of Information on the subject, making it 
very difficult to attend to all the stimuli that are occurring. The subject is 
asked to focus attention on the tones in one ear and to detect occasional targets 
in that ear which are difficult to discriminate from the non-targets. In this type 
of experiment, for instance, 10% of the tones (the targets) might be of slightly 
longer duration and sound a little louder than the non-targets. As the person 
listens to the tones in one ear and tries tfy detect those targets, the selective 
attention effect is seen in the ERP begin- ng at about 60 or 70 milliseconds after 
the tone onset. These are scalp recordings from the vertex, 1n the midline of the 
head. If you look at the ERPs to left ear tones when the left ear is being attended, 
those tones evoke large negative waves; when attention is shifted over the right 
ear, the left ear tones evoke small negative waves. The reverse happens to the 
ERPs elicited by the right ear tones. In general, the N] amplitude is high to 
attended-channel tones and is reduced to unattended tones. In conditions where 
neither ear is attended (reading a book), the Ni has an intermediate amplitude. 

We've come to think of this attention-related ERP 1n terms of the differ- 
ence between the attended and the unattended channels; this difference (between the 
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solid and c tted lines) 1s actually a broa^ negativity that usuaMy bagins before 
the N] peak and extends considerably after that peak. We used to think of this 
ERP as a simple enhancement of the Ni wave, an enlargement of the evoked potential 
that is always elicited by a tone. More recently, however, we've come to th^nk 
of it as a new, "endogeneous" negative wave that's adaed on the Ni of the evoked 
potential under conditions of selective attention (Hansen & Hlllyard, 1980). This 

4 

ERP has recently been called the "processing negativity" by some workers 1n the 
field (Naatanen & Mlchie, 1979), but I usually call it simply the "Ni effect." 

Now, I'd like to give a brief historical ^ume of some of the factors 
which have been shown to modulate this attention effect (reviewed in Hillyard 
et al_. , 1978; Hil'yard & Picton, 1979), In particular, the stimuli have to be de- 
livered at a fairly high rate; if the tones are delivered to the two ears at a 
rate cf one per second or less, there 1s no difference between the ERPs to at- 
tended and unattended tones. It appears that there has to be a sufficient load 
of Information to engage th« subject'^ selective attention. We also attempted to 
find out what kinds of sensory channels could be used to produce this negative 
ERP effect. In the experiment just described, the sensory channels were defined 
as the two ears, with the stimuli ^Iso differing in pitch. We later found that 
similar ERP changes are seen if the channel* are defined simply as two spatial 
locations (tones of the same frequency from two :patial locations) or if the 
channels are defined as two different frequencies presented from the same location 
in space. The N] effect is also present 1f the stimuli are o* the same pitch and 
at the same location but differ in intensity. That is, whc.eve. two channels of 
stimuli differ in any simple physical attribute that can be easily and rapidly 
analyzed, and attention is focussed on one of the channels, the attended stimuli 
will elicit this enhanced process rg negativity. Since this ERP effect had cer- 
tain parallels with Broadbent's (1970, 1971) concept of a "stimulus set" or 




"filtering" type of attention, we suggested that the processing negativity was 
correlated with the additional processing of sensory input after it had been 
selected by a Broadbent style filter or stimulus set mechanism. Another property 
of this ERP effect which 1s consistent with Broadbent's concept is that the neg- 
ativity is elicited by any stimulus that belongs to the attended channel, whether 
it's relevant to the task or not. Even though the subject has to respond to only 
sc-ne stimuli in a channel, all the stimuli belonging to that channel elicit the 
enhanced negativity. 

We've also done experiments in which we looked at the amplitude of pro- 
cessing negativity as a function of how subjects dlstrlbi^e their attention among 
t*o or more auditory channels (H*nk et al_. f 1977). For instance, when subjects are 
asked to divide their attention between the two ears and listen for targets in both 
ears instead of one, the ERP amplitude is intermediate between the attended N-| level 
and the unattended N] level seen during focused attention. Thus, when attention 1s 
focused on one channel, a large amplitude N] 1s elicited by all of the attended stim- 
uli while the ignored channel stimuli elicit a small N] ; when attention is divided 
between the two ears, however, the amplitude is almost exactly intermediate between 
these two values. This Implies that the total output of negativit: over a given 
time period is going to remain rather constant. This constancy o' whe total nega- 
tivity across conditions of focused and divided attention has been replicated by 
several laboratories (Parasurama»i, 1978; Oklta, 1979) and it appears *hat this nega- 
tivity may represent * type of capacity-limited processing system. While this no- 
tion remains speculative, it 1s clear from the ERP data that the processing nega- 
tivity reflects tne allocation of attention to different stimulus channels and per- 
haps (as described below) to different stimulus attributes as well, 

A recent exp^iment done in our laboratory by David Wooes and colleagues 
examined the properties of the selective "filter- 11 which allow us to focus attention 
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on one spoken message 1n a noisy environment (e.g., in the classic cocktail party) 
while ignoring competing voices. In this simplified version of the cocktail party 
experiment, subjects wore earphones and listened to either a male voice reading a 
novel 1n one ear or a female voice reading anotter novel in the other ear. The 
study was designed to asses the specificity of the attentional filtering process 
using ERPs. We were interested in determining the nature of the information which 
was admitted to the attended ear and/or rejected from the inattended ear. The be- 
havioral data on this phenomenon have been the focus of a classic debate. Origin- 
ally, an all-or-none filter was proposed which presumably blocked out the entire 
message in the unattended ear. Lawsor- (1966), however, challenged this concept 
by demonstrating that reaction times »ere equivalent to tones presented to the 
shadowed and unshadowed ears. Indeed, it was shown that certain types of informa- 
tion did get through from the unatt nded channel (e.g., frequency of the speaker's 
voice, after Ireisman; reviewed m Broadbent, 1971). 

In this experiment, we recot Jed ERPs to four different kinds of "probe" 
stimuli that we superimposed on the speech messages in each ear, to see how each 
would be "filtered" by the attentlonal process. These probes were superimposed on 
the speech in either ear at a rate of about one per second. The probes were: (1) 
a tone pip at the fundamental frequency of the speakers voice (about 100 Hz for a 
male ,oice and 200 Hz for the female voice); (2) a tone pip at the second formant 
frequency of the speakers voice (abojt 1000 and 1100 for the male and fenale voices, 
respectively); (3) the spoken word "ah n , and (4) the spoken word "but", the latter 
two in the actual voices of the speakers in each ear. These speech probes were 
produced by computer which made an A-D conversion of the person saying "ah" and 
"but" and superimposed these sound on the continuous speech. 

The subject's task in one condition was to listen to the spoken message in 
one ear at a time and try to remember its content for a later questiV naire. The 
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probe stimuli themselves were Irrelevant. It was a standard speech monitoring sit- 
uation—lister; to this voice and Ignore that voice. In a second condition, however, 
we required the subject to shadow, that 1s, to repeat the message 1n one ear aloud. 
Since the results were very similar for the shadowing and monitoring conditions, t h e 
ERP data in Slide 3 are collapsed over both conditions. We wanted to look at ERPs 
during shadowing 1n particular, because there's a report 1n the literature (Robinson 
& Sabat, 1975) which claims that when people shadow speech on one ear, they completely 
block out information in the other ear; that 1s, stimuli 1n the rejected ear elicited 
no cortical ERPs. We wanted to see 1f we could replicate that effect, which could 
have been caused arti factually by the masking of the unattended sounds by the shadower* 
voice. To avoid this problem of masking by the subject's voice, we presented che 
speech messages against a loud white noise background and had the subject whisper 
the message. 
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The effects of attention on the ERPs to the four different kinds of probe 
stimuli are shown in Slide 3. There's very little attention effect for the funda- 
mental frequency tone; that is, very little difference between the ERPs to probes 
on the attended and unattended channels. There is also little ERP differentiation 
in the H] latency zone for the second formant probe; this 1s a high-pitched tone pip 
which stands out perceptually in both attended and rejected voices and also elicits 
the largest ERPs in both channels. This 1s what one would expect from the behavioral 
data in dichotic listening situations. In contrast, the ERPs to the speech sounds 
that are superimposed on the voices show strong effects of attentlonal selection. 
Both the "ah" sound and the "but" sound elicit a broad processing negativity when 
they occur in the attended ear. 
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The sude shows only ERPs at a central midline electrode, but lateral 
scalp recordings revealed that the second phase of the processing negativity is 
asymmetrically distributed on the scalp, being large** over the left cerebral hem- 
isphere. Such asymmetries are not seen during attention to tone pips or non-verbal 
material (Hillyard & Woods, 1979). This suggests that the processing negativity 
wh.ch overlaps the N] wave reflects post-selection processing thnt is distributed 
in the brain according to the nature of the stimulus material to be analyzed; in 
the case of speech, of course, the left hemisphere should be the more actively 
engaged. 

The main point I want to make with these data, however, is that the "fil- 
ter" that selects one speech message in preference to another has rather specific 
properties; it's not simply admitting all the information within a particular fre- 
quency zone or in a particular ear, but rather it's selecting according to more 
complex properties of the speech sounds. The human speech selection system is a 
fairly sophisticated filter, if you will, tuned to the higher-order, patterned 
properties of the spoken message. Only stimuli having the appropriate sensory 
configuration are admitted for further processing. 

The specificity of attentional filtering in the visual system has been 
investigated by Russ Harter and associates of the University of North Carolina. 
(Harter & Previc, 1978). When attention is focused upon a particular locus along 
a sensory dimension, not only is information preferentially admitted at that locus 
but there is also a zone of loci around the attended point which receive preferen- 
tial processing. That is, if we consider a simple "searchlight" model of atten- 
tion, the searchlight's beam will have a certain diameter or width. Harter's 
group has tried to define this "bandwidth" of the attenticnal channel that selects 
for a particular size (spatial frequency) of a checkerboard. Nine different sizes 
of checkerboards were presented in random order, and the subject was instructed to 
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attend to one particular size and press a button every time 1t occurred. The 
main comparison was between the ERP to the attended size checkerboard minus the 
ERP to that stimulus when it was not being attended. Again, the attention effect 
was a broad negativity, in this case distributed over occipital scalp regions, 1n 
contrast with auditory attention where the Nj wave is more anteriorly distributed. 
Looking at difference waves across all check sizes, Harter 4 Prevlc found that the 
largest negativity was elicited by the attended size, with a progressive decline 
in amplitude for more distant che^ sizes. This function might be considered like 
a "roll -off" curve of the attentlonal selection process, where the ERP for each 
stimulus is plotted as a function of distance from the attended sensory locus, 
Harter' s work illustrates another way 1n which ERPs have been exploited to define 
the detailed quantitative properties of an attentional process. 

The overall picture of how visual attention affects ERPs, however, is far 
from clear. Paying attention to different sensory dimensions like color, position 
in space, orientation, or more complex patterns 1s associated with a different ERP 
configuration in each case. 

This contrasts with auditory attention where (as long as the attended and 
unattended sounds belong to separate channels) the attended stimuli elicit a broad 
processing negativity regardless of the nature of the sounds. Thus, it is conceiv- 
able that the visual system is associated with a wider variety of attentional mech- 
anisms than the auditory system. 

Question : If you were to switch around the relevant stimuli rapidly; 1n other words, 
just give one or two trials, uhen give the subject new instructions or a new signal, 
so that the first stimulus is now irrelevant, would you get a different spectrum of 

negativity? 

Yes. I think you would get a different result altogether. This effect is related 
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tc what Mike Posner was talking about—that sustained attention to a repeating 
source involves differed attentional process than does switching attention on 
every trial. We've made that comparlron in some pilot studies on attending to 
different visual locations; we found that when a subject sustains attention to 
a repetitive flashing light at one location, those flasnes elicit an enUrged 
Ni (N150) component (e.g., Van Voorhis & HUlyard, 1977). In contrast, when we 
did a variant of the Posner paradigm (Posner et al_. , 1978) where the subject 
switches attention on every trial, we didn't get the enhanced negative ERP at 
all. This ERP difference between sustained vs. rapidly switchable atl ntion 
suggests that we are dealing with different varieties of attention, but I don't 
know yet how to characterize these differences. 

Question : I was thinking that 1f you can get set for a long block of trials, you 
can sort of set up a gate and the selection becomes more automatic. In the other 
case, you've got to stop and do a lot more processing for each stimulus. 

I think that's a good interpretation. The data are consistent with that idea. 
There are much larger ERP differences *or sustained attention. However, I'm not 
sure whether these attention effects, which begin at 80-100 msec in the visual 
system are signs cf an early gating process. In the auditory system, the ERP ef- 
fects have properties which are very close to what one would expect from a 
Broadbent/Treisman type of filter. That is, when attention 1s directed towards 
stimuli having a simple physical attribute, all s*1mul1 which share that attribute 
show equivalents large ERPs. The "processing negativity" seems to have the same 
dynamic properties as the Broadbent-style early selection system. But, how early 
is early, I don't know. 

Question : Does the intensity of the stimuli affect the N] wave? 



Yes. Intensity has a strong effect. In general, the size of the N-j wave, the 
processing negativity, is Increased when stimulus intensities are reduced. This 
contrasts with the effect of lowering intensity on the evoked or exogenous com- 
ponents, which become smaller for fainter stimuli. 

I would now like to describe some very recent experiments by Jon Hansen 
in our laboratory, which look Into the question of how attention is allocated to 
the different attributes of mult1-d1mens1onal auditory stimuli. This experiment 
addresses an Issue that Ann Treisman has been working on lately (Treisman et al . , 
1977)--namely, when people attend to a stimulus that has two attributes like color 
and shape, or pitch and location, are those two attributes processed in parallel, 
independently, or 1s each stimulus analyzed from the beginning as a particular 
conjunction of those attributes. In other words, when you're looking for red 
letter N, do you process that stimulus immediately as a conjunction of attributes, 
a red N, or do you process it for its redness and "Nness" attributes separately at 
an initial stage, later conjoining the two attributes ir a subsequent stage (e.g., 

i 

of focal attention). These are two alternative models for what happens when you're 
attending to stimuli that have two attributes, and Hansen's experiments translate 
this question into ErtP terms, making predictions about the waveforms that one might 
expect under each model. 

The experimental paradigm is as follows: there are two sensory dimensions, 
pitch and location, with two levels of each dimension for a total of four stimuli; 
tone pips of two different pitches occurred at each of two locations. The subject 
is wearing earphc es, and high and low pitched stimuli (1500 and 600 Hz) are given 
in random order to both left and right ears. The subject's task on each run is to 
listen to one of these four dual-attribute tones and to try and detect occasional 
occurrences of targets which are of a longer duration. This 1s a very difficult 
task, requiring the subject to discriminate pitch and location attributes as well as 
tone duration ard to press a button when all these attributes are appropriate to the 
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targets. 

The main experimental question was the following: when you attend to one 
of these four combinations of pitch and location, do you process that tone as a 
conjunction of attributes, as a whole "object", or do you process each of the two 
attributes separately? Slide 4 shows some Realized waveforms of how the proces- 
sing negativity might look under each of these alternatives. If attention is fo- 
cused upon a conjunction of pitch-location attributes from the beginning, that 
stimulus should elicit an enlarged processing negativity, whereas each of the 
other three stimuli should elicit a much lower processing negativity. On the 
other hand, if the two attributes are processed independently, the amount of neg- 
ativity elicited by each tone should vary according to how ^any attributes it 
shares with the attended pitch-location combination (Slide 4A, upper tracings). 
A stimulus which fulfilled both of the cue requirements, having the attended pitch 
as well as the attended location, should elicit the largest processing negativity. 
The tones that have one of attended attributes but not the other should have an 
intermediate amplitude, and tones with neither attribute would have the smallest 
negativity. For the hypothesis of Independent attribute processing, the critical 
comparison is whether the ERP difference between tones having the attended attri- 
bute A versus the unattended attribute A varied as a function of whether attribute 
B was fulfilled or not. For instance, for tones of the attended pitch we can form 
a difference ERP between the attended and unattended locations (upper shaded area) 
If pitch and location are being processed independently, we would expect this dif- 
ference ERP to the attended ear minus the unattended ear tones to be the same for 
tones of the unattended frequency as well (lower shaded area). That is, the dif- 
ferential processing of one attribute should not depend upon the presence of the 
other attribute under the hypothesis of independent, parallel processing. 
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One could also consider the possibility of a hybrid of these two models, 
wherein the attributes are first processed independently and then, when the attri- 
butes are combined, the tones come to be processed as conjunctions; thereafter, 
only the attended conjunction stimulus would receive further proc2ss1ng, and the 
other stimuli that failed to meet any one of the attended cue criteria would not 
receive further processing. 

The next slide (5) shows the ERPs to the four stimulus alternatives (top 
tracings); these are ERPs to the shorter-duration, non-targets only. The top trac- 
ing 1s the ERP to stimuli that have the attended pitch and are 1n the attended lo- 
cation (ear). You see that the greatest processing negativity follows these tones, 
and there is quite a bit less processing negativity following each of the three 
other types of tones. That is, if a stimulus fails to fulfill either the pitch or 
the locational criterion, 1t does not elicit much processing negativity. This pat- 
tern of data is most consistent with conjoint processing model, although the fit is 
not perfect. These results are wholly inconsistent with the independent parallel 
processing model, however; this is particularly evident in the fact that waveforms 
3 and 4 are nearly identical to one another in processing negativity. Those ERPs 
are both elicited by tones of the unattended pitch, but one of them occurs at the 
attended location while the other occurs 1n the opposite ear. This suggests that 
when a stimulus is recognized as having the "wrong" (unattended) pitch, there is 
no further selection on the basis of its location attribute. Now, 1f location and 
pitch were processed with total independence, there should be some ERP differentia- 
tion Oh the basis of location even if the stimulus failed to have the attended pitch 
attribute. This can also be illustrated in the "difference ERPs" shows below; dif- 
ference wave 3 minus 4 shows the ERP differentiation between tones in the attended 
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ear minus the unattended ear when the pitch cue is wrong. This is » very small 
difference. If we look at the difference wave for the location cue when the pitch 
cue is fulfilled, however (that's 1 minus 2) we s a large differential of pro- 
cessing negativity. If the two dimensions were processed in parallel, difference 
wave 1-2 should be the same as difference wave 3-4. That 1s, the processing of 
location should not depend on whether the pitch cue was fulfilled or not. 



INSERT SLIDE 5 

It appears, however, that the results are not entirely consistent with the 
conjoint processing model , because the ERP uO the stimulus that has the correct 
(attended) pitch, but the wrong locallon is a bit larger than the ERP to tones 
which do not have the attended pitch attribute. That is, when a stimulus occurs 
at the wrong location, it is still processed to some extent if 1t has the right 
pitch. This suggests that Ditch may have a kind of primacy over location as at 
attribute for this type of selection t dS k. This effect is seen in the difference 
wave 2-4, showing some extra processing negativity following tones of the right 
pitch at the wrong location. 

Question : How much training do the people have or this task? 

Very little, these are mostly college students recruited for their first evoked 
potential experiment. 

s ostion : Is it possible that these dual-attribute discriminations could become 
automated after extensive training at the task and that the N-| component or pro- 
cessing negativity would change as a function of behavioral a..tomization. You 
could train subjects tn discriminate the four alternatives very rapidly, with v-y 
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fast reaction times, and see if the N] components change over that period of time. 

We haven't looked Into the question of learning and automation. I think 
that'd be an interesting problem to Investigate. When behavior becomes automatized, 
do the ERPs reflect that change? 

Question : I think that that's one of the question you've really got to answer, be- 
cause if I understand Treisman's conjecture, 1t 1s not whether people Initially 
process conjunctions, but whether or not they can come to process conjunctions after 
they've had sufficient experience with them. 

Well, Treisman's two-stage model 1s not dependent upon special learning processes, 
as I understand it. But I don't know to what extent these effects are modifiable 
by training. 

Further evidence on how people select multi-dimensional stimuli comes from 
other conditions 1n Hansen's experiment where the discriminablli ty of one of the at- 
tributes was varied. The experiment was Identical to the one just described, except 
that either the pitch or the location dimension was made less discriminable. To 
make the location discrimination more difficult, the interaural loudness balance 
was adjusted so that the tones were localized close together in space Instead of in 
the separate ears. The ERPs to each of these four tones dlstlnguisned by "easy" 
pitch cues (600 vs. 1500 Hz) and "hd*d" location cues (localized close together) 
are shown at the right of Slide 6. The results suggest that paying attention to 
one of the four tones now involves two distinct phases of selection. At about 70 
milliseconds after the stimulus there's an ERP differentiation along the "easy" 
: jnsion; that is, tones that have the appropriate (attended) pitch attribute start 
showing more negativity than do stimuli that have the wrong pitch. The solid and 
dashed lines are the ERPs to the tones having the attended pitch. If you're attend- 
ing to the high pitch tones located slightly to the left of the midline say, and the 




high pitch is easily discriminate from the low pitch, the high pitched tones 
start eliciting more negativity immediately. The ERP differentiation along the 
hard (location) dimension occurs subsequently; the processing negativity to t h e 
attended location becomes clearly larger only after a latency well beyond 100 
msec. After about 400 msec, all three of the irrelevant tones produce equivalently 
small processing negativity. 
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This particular pattern of ERPs, I think, is consistent with a model of a 
hierarchical selection process (idealized in Fig. 4B). Why hierarchical? If you 
look at the lower ERPs, you see that once a stimulus fails to meet the easy criter- 
ion (i.e., has the unattended pitch), there is no further ERP differentiation along 
the hard (location) attribute. Once a stimulus fails to meet the pitch criterion 
for a target there's no differential processing according to what location it's in. 
A two-stage selection model would fit this data, with an initial selection based on 
the easy attribute and a later selection that's based on the more difficult attri- 
bute. Each stage is reflected in the prolongation or termination of the processing 
negativity. Put simply, if a stimulus fails by the easy pitch criterion, it's not 
processed at all for location. 

Of course, the experiment also included another condition where pitch was 
the hard dimension (900 vs. 960 Hz) and location was the easy dimension, and in that 
case, there was a similar ERP pattern (shown in lower left waveforms of Slide 6). 
However, this latter pattern of ERPs suggests, as in the "easy-easy 11 condition, some 
minimal amount of proofing of pitch information even when a tone fails to meet the 
location criterion. This data is again consistent with the idea that tones of the 
attended pitch in the wrong location (ear) receive a little more processing than 
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tones of the unattended pitch. 

In conclusion, what I've tried to Illustrate here is that ERPs can provide 
useful evidence for the analysis of attentional processes, both to measure the tim- 
ing of different stages of processing and, more generally, to classify different 
kinds of selection mechanisms. 

I haven't talked about the P300 component yet, but if we were to look at 
the ERPs to the target stimuli that the subjects responded to, there would be a 
late positive wave at about 300-400 msec latency. The target tone ERPs would 
show an initial differentiation of processing negativity along the easy dimension, 
a second selection of negativity for the hard dimension, and then a final selection 
process by which the person decides it is a target; this latter decision would be 
associated with the positive P300. This ERP pattern suggests that the terminal 
selection of the target stimulus involves a very different kind of attentional 
mechanisms, associated with the P300, from the initial selections based on the 
other physical attributes of the stimulus. I think that the study of ERP configur- 
ations like these, or indeed, psychophysiological configurations in general, can 
help to differentiate among different types of attentional and cognitive processes. 

Question : Does the P300 reflect a highly cognitive type of processing or does it 
have a strong perceptual weighting? 

I'm not sure how to define the boundary between perceptual and cognitive processes. 
You can elicit I- 300s under very simple circumstances. For instance, if a repetitive 
sequence of stimuli is presented, any deviant stimulus that you have to react to dif- 
ferentially (e.g., make a counting response or a motor response) will trigger a P300 
wave. I think that type of task might involve a very simple discriminative system 
without a high level of cognitive processing. Of course, the P300 only occurs sub- 
sequent to a sensory discrimination process and it does depend on memory storage of 
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the relevant alternative stimuli and alternative plans of action. Such processes 
might be considered "cognitive". It 1s clear, inddently, that the P300 1s not 
associated directly with the final motor output but with Intermediate stages of 
stimulus evaluation and classification (see Donchin, 1979). 

Question : I'm surprised at how long the processing negativity lasts, given that 
the tones are so short 1n duration. Why Isn't there a rapid cut-off of negativity 
when duration is judged to be short and thus the stimulus "falls" on this third 
dimension? 

Good question. The durations of the frequent, non-target tones are 50 milliseconds, 
while the targets are 100 milliseconds long. So you're asking why does the proces- 
sing negativity extend out for several hundred milliseconds beyond the tone offsets. 
I would speculate that the duration discrimination (which is fairly difficult) is not 
made immediately, on-line, but occurs as the stimulus 1s being held 1n a sensory 
storage (or "echoic memory", as it's been called). That is, when a stimulus meets 
the proper pitch and location criteria, it is held in sensory memory for a further, 
more sustained analysis of Its other properties, to see whether It's a longer dura- 
tion target or not. The reaction times in this task are quite long, between 500 and 
600 milliseconds. I would speculate that this sustained ERP reflects further pro- 
cessing of a stimulus "image" that 1s being held 1n memory for a longer period than 
Its actual physical duration. 
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Remarks on Attention and Automaticity 
Daniel Kahneman and Anne Treisman 
University of British Columbia 

We shall be discussing > iree topics, with which we have 
recently been concerned, jointly as well as separately. First, 
we cast a backward glance toward the early studies of attention, 
in an attempt to determine whether the newer theoretical 
approaches to this domain can accomodate the observations that 
led to the formulation of earlier theories. Our conclusion is 
skeptical. We then turn to some data recently obtained in our 
laboratory, which suggest that some prototypically "automatic" 
activities in fact depend on the allocation of attention, in a 
manner that casts some doubt on their automaticity. In the 
third part of this talk we question the assumption that 
perception varies along a single quantitative dimension of 
processing adequacy, depth or extent. Perhaps, as several 
authors have recently suggested, the functions of attention are 
better described by a clerical or bureaucratic model. If this 
view is accepted, many current problems, including the issue of 
the locus of selection and the nature of automaticity, may have 
to be reconsidered. 

We have attempted to construct a list of major trends in 
the study of attention in the last decade. We have grouped 
these changes in three families: changes in the popularity of 
research designs, of concepts, and of theoretical notions. The 
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direction of each change, toward increased 
popularity, is indicated. 
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TRENDS IN THE STUDY OF ATTENTION 
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Selection 

Filtering 
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Continuous tasks 
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Multi-stage tasks 
Accuracy measures 
Discrete trials 

■ rch tasks 
expectancy costs and benefits 
RT measures 

THEORY 

Expectancy theories of attention 
Mental life located in LTM 
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The changes ul design have been particularly dramatic. The 
study of attention in the late 50' s and early 60' s was dominated 
by continuous tasks that required sustained attention. This was 
true in the early studies of vigilance, as well as in the 
studies of shadowing in selective listening. The stimuli were 
commonly presented in the auditory modality. The tasks had a 
relatively complex structure, and the quality of performance was 
measured by its accuracy. 

All this has changed. The standard study of attention 
today consists of discrete trials, typically with visual 
stimuli. The tasks involve a small vocabulary of responses, as 
in search or detection. The favored measure of performance is 
reaction-time. The main manipulation of attention controls set 
and expectancy, rather than selective instructions. Many 
studies are concerned in one way or another with measurements of 
the benefits of confirmed expectations and of the costs of 
unconfirmed ones. 

Turning to concepts, we note a marked decrement in the use 
of the notion of selective attention, and of the concept of 
filtering, both as a description of a particular research design 
and as the name of a possible mental operation, in contract, we 
note the recent surge of interest in the notion of automaticity , 
as well as in the concepts of expectancy and priming as 
mechanisms of attention. 

An obvious change in the nature of theorizing about 
attention is the increasing popularity of models that describe 
attention to an object as a state of anticipation, in its 
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modern form, such a model of attention was first articulated by 
LaBerge (1973), and it plays an important role in the views of 
Posner (1978) and Shiffrin (1975; Shiffrin & Schneider, 1977). 
No one could deny the role of sets and expectations as aspects 
of attention, but the models of LaBerge and Shiffrin appear to 
treat anticipation as the sole mechanism of selective attention. 
The main weakness of such a theory is that it cannot explain 
filtering. Consider those ancient studies of selective 
listening, in which the subject was instructed to shadow a 
message presented to the right ear, and to ignore a message 
simultaneously presented to the left ear. The success of 
subjects in this task, even when the messages consist of 
randomly chosen words, cannot be attributed to biased 
anticipations, since the listener does not know in advance which 
words will be included in the relevant message, and is therefore 
incapable of preparing to recognize those words, or to ignore 
the words that will be heard on the irrelevant message. 

The changes in theoretical attitudes and research practice 
shown in our list are highly coherent. The change c*." 
theoretical preferences in favor of expectancy models of 
selective attention was associated with a compatible change of 
research techniques: away from auditory messages, large 
vocabularies, and accuracy measures, toward discrete visual 
displays, small vocabularies, and reaction times. The result of 
these concurrent changes is that the dominant theoretical models 
of today explain the results obtained in the dominant research 
paradigms of today; however, our impression is that they no 
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longer account for the results that had led earlier to the 
formulation of filter theories. 

The loss of interest in the filtering paradigm has been 
associated with a shift toward late-selection models of 
attention, and with increasing interest in the notion of 
automaticity . The link between these developments is that 
late-selection models typically assume that the activation of 
logogens by their appropriate stimuli, is automatic, effortless, 
and unconstrained by mutual interference at the central level 
(Deutsch & D«*utsch, 1963; Keele, 1973; Shiffrin, 1975). 

For reasons that will be developed below, the expectancy 
model of attention and the emphasis on automaticity are highly 
compatible with a view in which mental life is represented as a 
succession of states of long-term memory. This view has perhaps 
been articulated most clearly by Shiffrin & Schneider (1977). 
We call it a display board model of the mind. Imagine the mind 
a^^ a board, in which each point is a light bulb that can be 
turned on, perhaps at several different brightness levels. The 
bulbs correspond to nodes in LTM, and they are connected in such 
fashion that activation may spread along some predetermined 
paths once a particular bulb is turned on. The lights on the 
display board correspond to dictionary units, or logogens. When 
\ our mental life is somehow concerned with a particular word, or 
another such unit of perception or thought, the light that 
corresponds to it will be turned on. In short, the lights are 
the devices by which we represent the identity of objects. A 
light can be turned on by the presence of its proper object in 



ERLC 



198 203 



the stimulus field, so that the board serves the function of 
those displays that tell you who is in the office at a 
particular instant. In a different way (the ditterence must be 
indicated, or else we could not tell fact from fantasy), the 
lights are activated when we think about an object, rehearse its 
name as part of a list, etc. 

The display-board model of the mind is quite powerful, and 
it provides an elegant account of many familiar effects. Let us 
first consider a partial list of phenomena with which it deals 
very well, before turning to other effects with which it deals 
rather poorly. A display-board model does well in explaining 
expectancy and priming effects. An expectancy is represented by 
a light bulb that is already preact ivated; this is done in many 
electrical circuits, especially with vacuum-tube technology. 
When the tube is kept n warm n , a small impulse will suffice -to 
turn it on. The display-board model is also good for explaining 
spreading activation, by invoking the associative connections 
among nodes. If we assume that some connections are inhibitory, 
the model could also accomodate the costs of expectations as 
well as their benefits: a bulb could be selectively de-activated 
(by a competing expectation) so that a larger-than-normal signal 
would be required to turn it on. Thus, the display-board model 
is an excellent device to represent automatic activation. 
Indeed, it seems to have been designed for that purpose. 

Without further elaboration, the board that we have 
described will not be able to simulate successful performance of 
a filtering task, since its only selective mechanism is a bias 
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of expectation, which requires advance knowledge of the relevant 
and irrelevant stimuli t Another difficulty for the 
representation of filtering is that the display-board model (and 
the logogen system or long-term memory of most current theories) 
is organized in terms of responses . The various units of the 
board are accessed or activated by complex conjunctions of 
properties, but they map one-to-one on responses that the 
subject may be required to make. In a system of this type, it 
is simpler to designate a unit by the response to which it 
corresponds uniquely, rather than by any stimulus properties. 
In particular, it would seem natural for attention to be 
directed to classes of units that belong to a given domain of 
responses, such as digits, animal names or French words. 
Contrary to tnis expectation, one of the best-established facts 
of attention research is the superiority of stimulus set over 
response set (Broadbent, 1970; Kahneman, 1973). Later in this 
report, we consider alternative representations of mental 
activity, which can accomodate this observation. 



The Problem of Automaticity 

There appear to be two classes of claims about 
automa ,icity , which can be simply described as strong and weak. 
The strong version is associated with a definition of 
automaticity which was perhaps put most clearly by LaBerge 
(1975), and appears in a slightly different form in Shiffrin's 
work (Shiffrin & Schneider, 1977). It defines a process as 
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automatic if its performance is not facilitated by the 
allocation of attention. Weaker claims are associated with a 
| weaker definition, which states simply that a process is 

i 

automatic to the extent that it occurs without attention. The 
difference, of course, is that a process can be partly automatic 

i 

! in the weak sense, but still be boosted by the allocation of 

attention; in the strong sense of automaticity , a partly 
automatic process is not automatic. 

The fact that many significant cognitive activities are (at 
least partly) automatic in the weak sense is not controversial. 
People may disagree on the degree to which this or that mental 
activity is automatic, or on the importance of the fact that it 
is automatic to this or that degree, but the weak automaticity 
claim could be on the platform of all political parties. 
Indeed, the notion of a filter which attenuates, rather than 
blocks rejected messages is equivalent to the assumption that 
semantic processing :s partly automatic. 

The feasibility of automatic processing, then is not at 
issue, although there may be discussion concerning the 
importance of attention in facilitating different classes of 
mental activity. The strong claim of automaticity, however, 
presents a sharp experimental Question. Indeed, the strong 
claim has the distinction of being refutable. Refutability is 
perhaps overrated, and a psychologist who asks a colleague to 
state a position that can be refuted usually does so in the 
spirit of asking someone to stand still so that one can punch 
him on the nose. On the issue of automaticity, some theorists 
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are obligingly standing still, and we shall take advantage of 
this fact. 

Filtering in , the Stroop Design 

Reading familiar words is often invoked as the very 
prototype of a highly automatized skill. And the Strdop task is 
often invoked as an example of the automaticity of reading. The 
subjects are trying to identify the color of the ink in which a 
word is pr i nted , but meanwhi le the shape of the word 
automatically and speedily activates the corresponding logogen, 
thus causing interference. This appears to be the standard 
account of the Stroop task, and of the link between that task 
and the notion of automatic activation in reading, because of 
the demonstration that subjects read uncontrollably, even when 
it is to their best interest not to do so. We now describe some 
studies that question this interpretation of the Stroop effect, 
and the supposed automaticity of reading that the effect is said 
to illustrate. 

Imagine a display that is tachistoscopically presented. 
The display consists of a square and a circle, which appear 
unpredictably on either side of the fixation point. The words 
RED and GREEN are printed, respectively in the circle and in the 
square. The word RED is printed in green ink, and the word 
GREEN is printed in red ink. Now imagine a display which is 
similar in all respects to the one just described, except that 
the words RED and GREEN exchange places, so that the color in 
which each word is printed corresponds to the meaning of that 
word. Consider a subject who is assigned the task of naming, as 
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quickly as possible, the color of the ink in the circle. The 
correct response is 'Red 7 in both' cases. Will it be made more 
easily and quickly in one of these cards* than in the other? 

The answer that one receives depends very much on who one 
is talking to. A lay person just laughs, because the result is 
intuitively obvious. Not so the attention theorist. Indeed, it 
is not at all obvious how a theory that contains the strong 
version of automaticity can explain a difference between the two 
conditions. Note that the subject is assumed to be fixating at 
the center, so that the quality of the sensory inputs is the 
same for both cards. If reading is automatic, then the logogens 
for 'Red' and Green' must both be activated by the printed 
words, equally on the two trials. Any facilitation or 
interference which is produced by such automatic activation 
should also be the same. 

Several versions of this experiment have been run (Kahneman 
& Henik, 1981). The results clearly favor the lay person over 
the attention theorist. In one of the experiments, neutral 
words were used, as well 4s color words. The results are given 
below. The conditions are identified by capitalizing the word 
to whose color subjects were to respond. The words in the 
display could be neutral, compatible with the correct response, 
or conflicting. The results shown are mean correct RT in msec, 
and percent errors. 
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NEUTRAL-neutral 


906 


(3%) 


NEUTRAL -compatible 


944 


(4%) 


NEUTRAL-c on flicting 


956 


(2%) 


COMPATIBLE- neutral 


858 


(1%) 


CONFLI CTI NG-neut ral 


1108 


(15%) 



The results show significant interference by a color word 
(even a compatible one f in this case), which is distant from the 
area to which attention is directed. However, this effect is 
quite small in comparison to the effect of an incompatible word 
which is physically conjoined with the relevant color. 

Thesa results lend themselves readily to interpretation as 
an example of filtering in a discrete task. As in any ether, 
instance of filtering, several stages of processing are 
involved. The relevant circle is found at an early stage. 
Attention is paid to it. The allocation of attention to the 
circle facilitates all the responses that are associated with 
various aspects of that object. In particular, it facilitates 
the responses that belong to the set of color names, because 
these responses have been primed by the color-naming 
instruction. Thus, there appears to be no control over the 
activation of the lcgogen by the shape of the attended word, and 
in this sense the reading of the word is automatic. This 
automatic process, however, must depend on the allocation of 
attention, since the word in the square produces much less 
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interference. It seems appropriate to ask how automatic an 
automatic process is, if it depends on prior selection? These 
results present an embarassment to theories that claim reading 
to be automatic in the strong sense. A similar conclusion has 
been reached by Prancolini & Egeth (1980), on the basis of 
rather similar experiments. 

The interpretation that we suggest assumes that visual 
attention is especially effective when it selects an input 
(Treisman, 1969) or an object (Kahneman, 1973). When an object 
has been selected, another selective operation must be invoked 
to determine which property of the object will be allowed to 
control responses. In general, the priming of a response 
category is enough to do most of the work of selection, because 
different properties of an object are rarely linked to different 
members of the same class of responses. The Stroop task is an 
exception, of course. It produces interference because an 
irrelevant property of the selected object is strongly 
associated with a response that has been primed by the task. 
The visual suffix effect, and perhaps the auditory suffix effect 
as well, could be interpreted in the same manner: an irrelevant 
member of a relevant group of items is not easily distinguished 
from its relevant neighbors, and causes as much interference 
with their processing as an extra relevant item would do 
(Kahneman, 1973; Kahneman & Henik, 1977, 1981). 

Visual filtering is a robust effect, which we have 
demonstrated in several experiments. In one of the studies in 
this series we presented two words, on either side of the 
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fixation point, ^ne of the words was always printed entirely in 
black, the other was printed entirely in colored ink, or had a 
single colored letter in it. The subject's task <ras to report 
the color of the ink. Here again, Stroop interference was much 
more pronounced when the colored ink and an incompatible color 
name were conjoined than when they were spatially separated 
(Rahneman & Henik, 1961). An interesting result was obtained 
when a single letter was colored. We had expected the control 
RT to be longer in this case than when a whole neutral word was 
printed in color, simply because a single letter is more 
difficult to find than a whole colored word. The expected 
difference was found, and it was significant. In addition, we 
predicted that naming the color of a single colored letter 
embedded in a color name would be associated with less 
interference than naming the color of the entire word. Focused 
attention to a single letter would reduce the probability, or 
the speed, of reading the incompatible word. Indeed, the 
measure of Stroop Interference was 73 msec for the letter and 
159 msec for the whole word. 

The results of these experiments illustrate the concept of 
filtering in visual presentation, and present some difficulties 
for the strong claim concerning the automaticity of reading, and 
for the interpretation of Stroop interference as evidence of 
such automaticity. The major conclusion is that it is essential 
to distinguish selection of inputs, or objects, from selection 
of properties. As we have seen, observers are capable of 
efficient rejection of irrelevant objects, but the irrelevant 
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properties (and perhaps parts) of a relevant object cannot be 
prevented from contacting their traces, and from activating 
irrelevant responses. The distinction between selection of 
objects (or inputs) and selection of properties (or analyzers, 
*n Treisman, 1969) seems salient and fundamental, yet it is 
often ignored in psychological research and theory. The 
difference between objects and properties was lost historically 
when the ambiguous term "stimulus" was adopted in psychological 
theorizing, because it can be said either that the object (e.g., 
a red 0) is the stimulus, or that a property (redness or 
circularity) is the stimulus. In the behaviorist tradition, the 
term was applied to "whatever controls a response". Because 
discriminated responses are controlled by properties (peck the 
key if the cage is illuminated, not if it is dark), it is most 
natural in that tradition to think of stimuli primarily in terms 
of properties, and to ignore the notion of objects altogether. 
This legacy has influenced current studies of information 
processing. It is illustrated by treatments that interpret 
Stroop interference as a failure of selective attention and as 
evidence of the automat icity of processing, in fact, the Stroop 
effect demonstrates that people do not easily select among the 
properties of the attended object. On the other hand, the 
present results add to the extensive evidence from other studies 
of filtering to support the conclusion that irrelevant objects 
can be rejected quite effectively. 

2ib' 
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The Dilution Effect 

We now turn to another set of tests of the notion of 
automatic activation, which we label the dilution effect. These 
tests are addressed a straightforward prediction of the 

strong version of automat icity , as formulated by LaBerge( 1975) . 
If automatic activation of a node or logogen by a familiar 
stimulus does not require attention, then the activation of that 
node should not be affected by the presence of other objects in 
the field. 

A st rone claim of automatic, processing could be supported 
in either of two ways. The first is a demonstration that the 
efficiency of processing a relevant stimulus is unaffected by 
advance knowledge of its nature or location. The second is a 
demonstration that efficiency is also unaffected by the 
concurrent presentation of irrelevant stimuli. The first 
procedure was introduced in LaBerge's classic experiment, which 
attempted to show that the processing of familiar stimuli is 
independent of attention, by demonstrating that such stimuli are 
processed no faster when they are specfically expected than when 
they are not, in contrast to unfamiliar stimuli, which show 
benefits of expectations. A well-known series of studies by 
Shiffrin and his collaborators (summarized in Shiffrin, 1975) 
appeared to make much the same point, by showing that prior 
knowledge which permitted the focusing of attention on a channel 
sometimes failed to facilitate the processing of a stimulus 
presented on that channel, when other stimuli were presented 
the same time. More recent experiments (Duncan, 1980; Posner, 
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1978) using the methods of Shiffrin and LaBerge respectively, 
have raised some doubts about the absence of anticipation 
effects in these designs. However, the absence of display size 
effects in some search tasks provides incontrovertible evidence 
for the automaticity of that operation, at least under some 
conditions (Egeth, Jonides t Wall, 1972? Schneider & Shiffrin, 
1977; Treisman & Gelade, 1980). 

If a search task can be performed in parallel over an 
entire display, then the performance of that task is automatic 
in the strong sense. A filter theorist would say that such 
search tasks are performed by pre-attent ive mechanisms, since 
parallel operation is the defining attribute of these 
mechanisms: Broadbent has commented on the fact that his 
S-system and P-system had been unfortunately labeled, since the 
operations of the former system are parallel and those of the 
latter are serial. Schneider & Shiffrin (1977) established an 
important fact that would not have been anticipated in filter 
theory: with prolonged practice and consistent mapping of 
stimuli to responses, search could be automatized even when the 
targets shared no obvious physical features. It is important to 
note, however, that the response that was automatized in these 
experiments was a simple instrumental response. There is no 
evidence in this work that automatization can occur when a large 
set of stimuli is mapped, even consistently, onto a large set of 
responses. 

The distinction between filtering tasks and search tasks is 
defined by the nature of the mapping of stimuli to responses. 

209 




The tasks of search and detection radically simplify the problem 
of response selection, since the response is completely 
determined as soon as the relevant stimulus has been detected. 
In a filtering task, such as selective shadowing, or report of 
the middle row of a Sperling array, a problem of response 
selection remains aftsr the relevant part of the message has 
been found. It is not enough to find the relevant row, the 
particular sequence of digits must be produced. Reading is the 
prototypical instance of an information-preserving activity, in 
which the richness of the stimulus ensemble is preserved in the 
repertoire of responses. It is therefore of interest to ask 
whether the "automatic" reading which is assumed to occur, for 
example, in the Stroop situation is subject to an effect of 
display size. The strong claim of automaticity allows no 
mechanism that could produce such an effect, except sensory 
interference. 

We have studied this question in a long series of 
experiments. In a typical study, the subjects are shown a 
colored bar centered on the fixation point, and are asked to 
name the color of the bar as quickly as possible. On some 
trials, a single word is presented, some distance above or below 
the bar. The word is sometimes unrelated to the color-naming 
task, sometimes compatible with the response that is correct on 
the trial, and sometimes is the name of another color. As many 
other investigators have found (Dyer, 1973; Gatti & Egeth, 
1978), the presentation of a color name affects the speed with 
which the color of the bar is named. Interference and 
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facilitation are both obtained, although the magnitude of the 
effects is smaller than when the relevant color and the 
irrelevant word are conjoined. The occurrence of Stroop 
interference in this situation represents at least a partial 
failure of selective attention to objects. The relevant color 
bar is presented at the fixation point and the subject is 
encouraged to focus attention on that area; the word is 
irrelevant and its reading is surely involuntary. The purpose 
of the present set of experiments is to determine whether the 
reading of irrelevant words is also 'automatic 1 , in the sense 
defined by the absence of a display-size effect. We introduce a 
minimal variation of display size: on some trials, an extra word 
is presented, on the other side of the relevant color bar. Our 



will affect the amount of interference or facilitation that the 
color word produces. 

We have run this experiment, in diverse variants, with more 
than 100 subjects. The consistent result of these studies is 
that the facilitating effect of a compatible color name and the 
interfering effect of an .incompatible one are both reduced by 
the presentation of a neutral word. We call this result the 
dilution effect, since the impact of a task-related, distracting 
w^rd is reduced by the presentation of another word, which is 
unrelated to color naming. In several experiments we have 
observed that adding a neutral word to the display retards 
color-naming in the presence of a compatible color word (by 
reducing the normal faciiiation which that word would produce) 
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question is whether the presentation of the added neutral word 
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and improves the speed of color- naming, in the presence of a / 

color word which is incompatible with the correct response. The / 

/ 

effects are small (interference may be reduced from 90 to 60 
msec, and facilitation from 60 to 40 msec), but they are highly 
consistent . 

We have compared the dilution effect which is produced by 
adding an extra word to the display to the effect of adding a 
row of X's. Much to our surprise, the effects turned out to be 
nearly identical. In one experiment, a word diluted the 
interfering effect of a conflicting color name by 20 msec, and 
the facilitating effect of a compatible one by 25 msec. The 
corresponding values for a row of X f s were 19 and 15 msec, and 
the difference between the conditions was not significant. We 
have replicated this result ad nauseam. It is solid in the 
context of the Stroop experiment, although we have found other 
situations in which a neutral word causes greater dilution than 
a row of X's. 

The dilution effect presents an embarassment to the claim 
that involuntary reading is automatic and unaffected by 
attention. (We suppose that no one would wish to extend that 
claim to voluntary reading). If the visually presented words 
automatically access their corresponding nodes in long-term 
memory, and if this access automatically activates primed 
responses, the number of words that are simultaneously presented 
in the display should have no effect on the efficacy of this 
priming. However, the valid.cy of our argument depends on our 
ability to demonstrate that the two items do not interact at a 



peripheral level. This turns out to be quite difficult, because 
the notion of peripheral interaction has recently been broadened 
to include various types of feature interactions a f id 
perturbations (Istes, 1972; Wolford, 1975, 1980) , and the 
conditions for these interactions are only poorly specified. It 
seems reasonable to argue that one of the defining properties of 
peripheral interactions is that their magnitude should depend on 
the retinal distance between the interacting elements. In 
several experiments, however, we have found that the distance 
between the diluting word (or row of X's) and the color name had 
little effect on the facilitation or interference produced by 
that name. 

We conclude that the dilution effect provides evidence 
against the notion that involuntary reading is automatic, 
eflortless and independent of attention. However it is 
important to note that the notion of automatic reading assumes 
the automaticity of two distinct processes: (i)the activation of 
the appropriate node; (ii)the spreading of activation from this 
node to other nodes, or to structures that control responses. 
The dilution effect suggests that one of these events is not 
automatic, and is subject to interference by the concurrent 
processing of other stimuli. What is the nature of this 
dilution effect? 

The following possibilities come to mind: 
(i)The dilution effects are due to peripheral interactions that 
reduce the signal-to-noise ratio in the sensory message. In 
this view, dilution is an artifact. We have mentioned some of 



ERIC 




213 



our reasons for believing that it is not. 

(ii) The processing of information is facilitated by the 
allocation of some resource to the relevant region. Dilution 
occurs when these limited resources are distributed among 
several regions or objects of processing, 

(iii) The onset of a stimulus elicits an automatic orienting 
response. (Posner, 1978; Posner, Nissen t Ogden, 1978). The 
presence of several stimuli in the field induces a conflict 
among competing orientation tendencies, and fully effective 
processing is delayed until this conflict is resolved. 

(iv) The accumulation of information in recognition nodes (or 
logogens) is not affected by the concurrent accumulation of 
information in other nodes, but the efficacy of outputs from any 
one node in controlling responses or in activating other nodes 
is reduced by concurrent activity elsewhere in the system. 

(v) Simultaneously present stimuli all evoke response tendencies 
and the appearance of a dilution effect is produced by response 
conflict (Eriksen & Eriksen, 1974). 

To advance the choice among these possibilities, we have 
used a situation that we label the dummy-conflict design , in 
the basic variant of this design, the subject is shown either a 
single word, which appears unpredictably above or below the 
fixation point; or two words, one above and one below fixation. 
The instruction is to read either of the presented words, as 
quickly as possible. An obvir s argument could be made that the 
response to the dual display can only be faster than the 
response to a single word, since the more rapidly processed 




tnember of the pair should control the response. In fact, the 
presentation of two words causes a slight, but highly consistent 
slowing of the reading time: from 521 to 555 msec in a typical 
experiment. Here again, the distance of the words from the 
fixation point does not seem to matter very much. An 
interference effect of 25 msec was still found when the nearest 
contours of the interacting words were 3 degrees apart, across 
the fovea. Furthermore, the effect did not depend on the 
presentation of a second readable word: a word-sized patch of 
random dots on one side of the fixation point caused about as 
much interference as an additional word. 

It is of course not certain that the interference observed 
in the present experiment has the same origin as the dilution 
effect which was discussed previously, but the hypothesi* tha 
the two effects are similar is both plausible and parsimonious. 
The observations made in the dummy-conflict situation permit\the 
elimination of some ot the possible interpretations that were 
mentioned earlier in the context of the dilution effect. 
Hypothesis (i), which attributes the interference to peripheral 
interaction, is not supported by its insensit ivity to retinal 
distance. Hypothesis (v), which attributes the interference tc 
conflict between incompatible responses, cannot explain the 
effects of the dots. Hypothesis (iv), which assumes 
interference between the outputs of different logogens is also 
rejected, for the same reason. The remaining hypotheses 
attribute the interference to competition for limited processing 
resources or to a conflict between orienting tendencies, which 




must be resolved before the processing of a particular region of 
the field can be carried out at full efficiency. We hope to 
test these possibilities in future experiments. 



Concluding Remarks 
We believe that our experimental results provide 
substantial evidence against the position that we have labeled 
the strong claim of automat icity. We found that the involuntary 
reading of Stroop words was affected by the allocation of 
attention to a spatial position, and also by the presence of 
another word, or row of X's. The accuracy and the speed of 
voluntary reading are also affected by the presence of an 
^irrelevant item in the field, even when the distance between 
words makes peripheral interaction unlikely. In our 
experiments, then, reading does not appear to be fully 
automatic. This^oticlusion, however, appears to be in conflict 
With recent evidence of the. potent effects of words which are 
'read' without awareness (Marc>l, 1981). In a provocative 
, Series of experiments, Marcel observed\that words that are 
ynasked by pattern, to the point that observer^ do not know they 
are there, nevertheless can prime responses t^ a subsequent 
target in a lexical decision task, and cause interference or 
facilitation in the naming of a color patch. \ 

In our laboratory, Peter Porster has confirmed a higMy 
reliable (although f rustrat ingly small) facilitating Effect of a 
masked color word on the naming of a subsequent color patch. We 
have both been subjects in the experiment, and both Showed a 
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substantial effect of words that we never consciously 'saw', 
although we knew they were shown on some trials and were 
watching for them. Evidently, there was some reading of the 
unseen words, and it appears very natural to apply the label 
• automatic 1 to this reading. How can this conclusion be 
reconciled with our other results? 

It seems easy to achieve a superficial reconciliation. One 
possibility is that the activation of logogens by subliminal 
stimuli occurs because reading is partially automatic. In other 
words, we can read without consciousness, but perhaps at a lower 
grade level than usual! It may be a mistake to equate attention 
with conscious experience. In Marcel's subliminal experiments 
the subjects were certai.ily attending to the location of the 
'unseen' words. It is possible that attention limits could also 
be shown subliminally ; for example, interference and 
facilitation from subliminal words might be diluted by the 
addition of subliminal neutral words. 

There are some indications that such patching of our models 
may not be adequate. A more fundamental revision will be 
required, if evidence is obtained for a pattern that we label 
strong dissociation . Marcel's results are an instance of a 
dissociation between phenomena that we expect to be link 1. 
They reinforce the conclusions of a large literature on 
subliminal effects, confirming dissociations between awareness 
and other indications that a message has been registered and 
understood (Dixon, 1971). These results are often compatible 
with the notion that different response systems apply different 
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criteria to the same information. In strong dissociation, < 
however, unconscious processing is as precise and refined as 
conscious processing would be, and it is therefore not possible 
to argue that available information did not meet the higher 
criteria required for registration in consciousness, although it 
sufficed to trigger some low-level responses. The conclusion 
from strong dissociation would be that information which is 
available to control some responses is not made available to 
conscious elaboration. The 'Idence for strong dissociation is 
very scanty, but intriguing. • Thus, von Wright, Anderson & 
Stenman (1975) t orted that the emotional responses to synonyms 
of shock-associated words (but not to the words themselves) were 
the same regardless of whether 



attended or on the unattended channel, and Marcel (1981) 
reported some data in which the priming effects of subliminal 
and supraliminal stimuli were of the same magnitude. 
Confirmation and extension of such findings would force many of 
us to reconsider our presuppositions about information 
processing and conscious experience. 

Dissociation phenomena are troubling because they raise 
doubts about the validity of our sense of personal identity. If 
my skin responds to the emotional significance of words that I 
have not seen, do i_ know the meaning of these words? There 
seems to be no good answer to this question. The solution to 
the dilemma may be to revise our criteria for the use of 
epistemic words such as "know", "see", or "understand". In 



these words were presented on the 
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particular, the suggestion has been made by authors in the 
traditions of artificial intelligence (Minsky, 1975), philosophy 
(Dennett, 1978) and experimental psychology (Allport, 1980) that 
we treat the mind as a collectivity of semi-independent 
entitles, rather than as a single one. 

Perhaps we should take as our model of the mind a large 
organization, such as General Motors, or the CIA. Under what 
conditions can such an organization be said to know something? 
Certainly, the organization "knows" a fact if all the 
significant individuals within it know it, as shown by their 
actions. But there are many borderline cases. Does the CIA 
know a fact if one functionary in that organization knows it, 
but has told nobody else, or is believed by no one? Does an 
organization know a fact if the lower echelons act on it, but 
without informing higher echelons that they do so? The 
observation of strong dissociation phenomena suggests that it 
may be as difficult to assign epistemic states to individuals as 
it is to assign such states to organizations. 

It now appears at least conceivable that future discussions 
of attention may be conducted within the framework of an 
organizational metaphor for the mind. It is disconcerting, but 
perhaps also encouraging, that many of the questions with which 
we have been concerned for years --including the question of 
automaticity that was the focus of this paper — will turn out, 
in such a framework, to be slightly out-of -focus. The 
proponents and opponents of the idea of automatic semantic 
processing, just like the proponents and opponents of early 
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selection, shared many presuppositions. In particular, they 
si»ared the idea of a standard path of information processing, 
and the idea that attention operates at one or more bottlenecks 
(or road-blocks) along that path, to select the messages that 
should be processed further, or perhaps to attach to them a 
single value of relevance. While we continue the debate within 
the old framework, we should remain alert to the possibility 
that it could soon become obsolete. 
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THE ROLE OF ATTENTION IN OBJECT PERCEPTION 
Anne Treiantan 
University of British Columbia 



I'd like to suggest a new view of attention — ?~ at least of one role 
it may play in perception. Let me set the scene by flashing the first slide 
briefly. You probably all had an immediate impression of recognizable ob- 
jects, organized coherently in a meaningful framework; a familiar experience. 
Analysis into more elementary sensations is difficult and feels abnormal; 
yet in order to perceive and identify any complex object, we must normally 
register not only its features (shape, size, color, etc.), but also the fact 
that they are conjoined in a particular configuration. We see a rose as red, 
its leaves as gree^ and the vase as grey. This allocation of colors to shape/, 
sizes to orientations, voices to locations, seems immediate and automatic. / 

However, there has been a long controversy in Psychology about how we 
achieve this impression of unitary objects. Do we build the world we perceive 
out of more elementary sensations, or do we directly register wholes $ mean- 
ings, relations? For a time the Association! st or building-block view was 
generally accepted; complex objects were painstakingly pieced together from 
more elementary sensations. Bait it has alternated with strong swings to the 
other extreme, for example with the Gestalt movement, and new again it is more 
fashionable to talk of 'top-down 1 processing, and of 'frames 1 , and to empha- 
size the vholistic quality of immediate perception. Intuition and introspec- 
tion push us catipellingly towards the Gestalt position. We do see people, 
plants, buildings; not colors, edges, and movements. However, the imnediacy 
and directness of r conscious experience are no gurantee that it reflects an 
early stage of information proce jing in the nervous system. We have little 
idea what the relation between subjective experience and neural coding could 
be. This claim can therefore only relate to the order in which we can intro- 
spectively access different levels of representation* We may become aware 
only of the final outcome of a complicated sequence of prior operations. 'Top- 
down' may describe our experience, but for a theory about perceptual coding, we 
need other kinds of evidence. 




Rec it physiological research shows that the early stages of coding depend 
on sp 'alized populations of cells, tuned selectively to discriminate particular 
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properties. These properties seem to be mapped into different areas of the 
brain (Zeki, 1976). If we look at Psychology, we also find evidence that 
at least sane dimensions are 'separable 1 (Garner, 1970); they can be pro- 
cessed independently and in parallel, they can be selectively attended to, 
and they contribute independently to judgments of similarity. Even pheno- 
menological evidence is not unequivocal: The next two slides will show that 
there is scene difficulty in perceiving complex pictures when the context is 
quite unfamiliar. You may, in fact, have registered mainly colors and lines 
rather than meaningful objects. It seems at least possible that our normal 
glimpses out of the corner of the eye in everyday life are equally imprecise 
or ill-defined, without our being aware of it. 

I would like, at least provisionally, to accept that there is some decom- 
position of the visual input into separable parts and properties. This at 
once raises the question how these properties get put together again, and how 
we normally avoid mistakes in the form of "illusory conjunctions". Why don't 
we see a gray rose with red leaves in a green vase? One answer is that in 
natural contexts there are many known constraints limiting which features we 
can sensibly combine: grass is normally green, the sky blue or grey, people 
have noses and tables do not. Perception maps sensory data into expected 
'frames' , selecting the combinations t>f features that make sense. On the other 
hand, in less predictable situations, a rapid glance or a diffuse and inat- 
tentive gaze may in fact generate som* hallucinatory couplings, without our 
always taking the time to check their validity. A friend walking in a busy 
street 'saw' a colleague and was about to address him, when he realized that 
the black beard belonged to one passerby and the bald head and spectacles to 
another. 

So how do we normally avoid making these 'conjunction errors' whenever 
the context leaves some ambiguity? I would like to suggest a new hypothesis 
about the role of attention, which links it directly to this problem of object 
perception. I will state the hypothesis in a fairly extreme form, because it 
makes it both clearer and easier to refute. But I would like, from the outset, 
to stress that it is still a tenative proposal, whose implications I am still 
actively testing and exploring. 
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My suggestion is that we specify which features should be integrated into 
one object by scanning each location serially with focal attention. Any fea- 
tures that we register within the same central 'fixation 1 are integrated into 
a single percept. In other words, focal attention provides the glue which 
puts objects together. I believe there are two distinct levels of process- 
ing; at the first, all the separate features are coded independently and in 
parallel to form different feature maps, using populations of feature de- 
tectors for colors, orientations, directions of movement, spatial frequencies. 
This stage organizes the perceptual world into groups, textures, homogeneous 
areas — candidates for possible objects and events to be identified at the 
next stage using focused attention. It is similar to the preattentive stage 
first proposed by Neisser in 1967, although for me it is also preconscious , 
and there are one or two other differences which I will discuss later. 

Before any features can be consciously perceived, they must be recombined. 
Conscious experience is populated not by disembodied orienations, colors and 
movements, br*- by objects and their backgrounds. Feature registration is 
followed, iien, by a feature- integration stage, where selective attention 
mediates the formation of multi-dimensional objects • Whenever the context 
is unfamiliar, so that we cannot use prior knowledge to match up the features 
of objects, an act of attention is necessary to ensure that correct conjunc- 
tions are formed. We attend to one object at a time; in other words, serial 
processing is necessary. 

I had this idea four or five years ago and since then I have been trying 
to test it. Let us see what predictions it generates. 

1) If an object has a unique feature, we should be able to detect or 
identify it independently of the number of other objects which are present. 
So search for a feature target should be parallel across a display. 

2) If an object has no unique feature and is defined only by a con- 
junction of features, identifying it should require focal attention to each 
item in turn, and should therefore force serial processing. 

3) It seems likely that texture segregation and figure-ground grouping 
are pre-attentive , parallel processes, they should therefore be determined 
only by features and not by conjunctions of features. At this pre- attentive 
level, the theory claims that there are many separate worlds — a world of 
color, a world of shape, a world of movement, all organized along their own 
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particular lines, but not related or unified until attention is brought „ 
bear. Here is one major difference from Neisser's single, global preattentive 
world. 

4) If attention is prevented or overloaded, features will be "free- 
floating" with respect to one another. This means chat illusory conjunctions 
could be formed when -ore than one object is present, by wrongly recombining 
the features of different objects. For example, the red blouse and black hair 
of the woman in the first slide were_ conjoined by one subject to give a red- 
haired woman. - T 

\ 

\ 

5) Without attention, features could also be ^tially." free-floating. 
In other words, we may not know exactly where they are located, although we 
can certainly home in on them very rapidly. This could\*ot be the case for 
conjunctions, if we identify a conjunction, we must fiis\ have located it. 

6) Finally, we should be able to use these phenomena to identify new or 
doubtful features. For example, is closure an elementary feature? if so, it 
should allow parallel search and texture segregation and it should join in 
illusory conjunctions when attention is overloaded. 

I have tried to test these predictions in a number of different paradigms 
and I'd like to describe some of the results. The first experiments test the 
serial search prediction - the idea that attention must be directed to each 
item in turn whenever the conjunction of features is relevant. The next three 
slides illustrate the task, m the first two, the targets to look for are 
either the letter Vera blue letter. ■ kn the third, the target is a green 
•T'. You probably found that the target jumped out' of the background of 
green 'Xs' and brown -Ts- in the first two ckses and required a painstaking 
scan in the third case, unless you were lucki and happened to focus on it 
early, m the experiment, we tested two conditions of visual search, m the 
first, the target was defined by a conjunction of features (green -T-), while 
in the second 4t was defined by two disjunctive features (either »S« or blue). 
Subjects were to press one key if a target was present and the other key if 
no target was present. We measured the time they took. The distractors were 
always green 'Xs' and brown 'Ts'. Notice that subjects had to check two di- 
mensions in both conditions, but needed to check a particular spatial combi- 
nation only in the conjunction condition, if focal attention is required 
for conjunctions, the scan should be serial and self-terminating. We there- 
fore predict (a) that search time will increase linearly with the number of 
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items in the display; (b) that the positive slope should be half the negative 
slope , because on average, when the target is present, subjects will find it 
after checking half the items. For the feature targets, neither prediction 
should hold. Figure 1 shows the results, it shows a flat or non-monotonic 
relation between display size and detection time for the feature targets. 
There is no evidence that subjects scan serially in order to find the blue 
letter or the 'S', targets that are defined by disjunctive single features. 
When the target _is present, it apparently jumps out at you; no scanning is 
necessary. When it is not present, subjects are much slower and do take 
longer, the more items are present. They may scam the display to make sure. 

With conjunctions, on the other hand, we have a very different pattern. 
Both functions are straight lines, linearity accounting for 99.7% of variance 
due to display size. The ratio of positive to negative slopes is 0.43, which 
is quite close to one half. Notice that we cannot explain this different 
pattern of performance by the similarity or confusability of targets and dis- 
tractors, because this should affect displays of one in the same way as the 
larger displays. The difficulty with conjunctions arises only when more than 
one item is present. So far, then, the results agree with the predictions. 

How general is this finding? What happens with practice on conjunction 
search. Can we get rid of the serial scan and automatize the search for con- 
junctions? Can we perhaps set up a new unitary detector for a "green 'T ■ " 
which will allow it to jump out of the display without focal attention. Wte 
ran four of the subjects for seven sessions and two of them for thirteen ses- 
sions and found no evidence that the pattern was changing. The slope ranained 
just as linear on block 13 as on block 1. The slope and intercept both de- 
creased with practice but most of the change came in the first three blocks. 
It is possible that if we had doubled the number of sessions we might even- 
tually have found a change, but there is little sign of one here. It is in- 
teresting to speculate that there may be built-in neural constrairts on which 
properties can be unitized. Perhaps we just aren't built to respond automat- 
ical ly^tQ^ green 'Ts'. 

Let's look at one or two obvious variations of the task. Suppose we make 
the features easier or harder to discriminate. What should happen, if my 
hypothesis is correct? It should maka each of the serial checks for the 
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conjunction target either slower or faster; so it should change the slope re- 
lating search time to display size without changing the pattern of linear 
functions and the two to one ratio of negative to positive slopes. That is 
what we found. We had subjects searching either for a red "0 1 in green "0 1 
and red 'N' distractors, where discriminability was high both for shapes and 
for colors, or for a green »T I in green 'X' and blue 'T 1 distractors, where 
discriminability at the feature level was considerably lower. The search 
rates (see Figure 2) vary quite dramatically (92 msec per item for conf usable 
colors and shapes to 40msec per item for easily discriminate ones). Yet 
they preserve the pattern of serial, self-terminating search. This is im- 
portant because it confirms that the difference between the conjunction and 
the feature conditions is not simply a difference in difficulty. We might 
have argued that conjunction search requires attention only because it is 
harder than feature search. That doesn't seem to be the case. 

Another question: is the serial scan really an eye movement scan rather 
than an attention scan? Is it our mental or our physical eyeball that e move 
successively from item to item? If we use smaller displays more centered on 
the fovea, will we chciige the pattern? The next slides show two new conditions, 
one with densely packed items and one in which they are spread over an area 
which i'j twice as large. Figure 3 shows the results. The scanning rate is 
the same for the dense and the sparse displays. The serial self-termir.ating 
scan is clearly not a function of retinal distance, as it would be if visual 
acuity and eye movements were determining the search time. The results are 
more consistent with the idea that the serial fixations are made with a mental 
rather than a physical eyeball — with our internal att^ntional spotlight. 

A final important question about generality: V, all these findings apply 
only to color and shape conjunctions, or ar* they generally true of separable 
features? For example, could they hold up with local elements of more complex 
shapes — lines, angles, curves, etc. Must we use focused attention to tell 
us, for example, whether the cross-bar in a particular display goes with the 
two diagonals to form an 'A 1 or with the two verticals to from an 'H 1 ? 

I tested this question by choosing sets of letters that could potentially 
give rise to illusory conjunctions, compared to letters that could not. One 
target in the conjunction condition was an 'R' in a background of 'Ps 1 and 
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*Qs'. The 'R' could be made out of the 'P' and the diagonal line of the 'Q'. 
Another was a V T V in a background of 'Zs' and 'Is*. Would these targets 
therefore require a serial scan? Or is the difficulty of search through 
letter sets determined simply by how conf usable t-he targets and distractors 
are? I used as control conditions, search for letters which, taken indi- 
vidually^ were more confusable with the targets: 'R' in 'Ps' and 'Bs' and 
'T' in 'Ys' and 'Is 1 . Figure 4 shows the results: Again, we have linear 
functions and apparently serial search through the letters that I hypothe- 
sized might be at risk for illusory conjunctions, and faster, non- linear 
functions for the others. The ratio of positives to negatives was close to 
1/2 for the conjunctions and much less for the similarity controls (0.27) • 
Again, we have evidence for two separate underlying processes for positive 
and negative decisions with the similar letters, and a single serial process 
for the conjunctions. The order of difficulty reverses when the distractors 
are homogeneous. Target 'R p in 'Bs' is then harder than in 'Qs'. It is only 
when we add 'Ps 1 to both that conjunction errors with 'Q's become possible 
and focused attention becomes necessary. 



The conclusion seems to be that even with such higly familiar stimuli 
as letters, the risk of illusory conjunctions can arise and force the use of 
focal attention to integrate the local features into the correct units. I am 
not claiming that serial scanning is necessary for all sets of letters* Any 
that can be distinguished by single features (e.g., the presence of a curve 
or a diagonal) should potentially be recognized in parallel. I think this 
experiment hatf also shown that similarity between individual items is not the 
most powerful variable determining attention limits in visual processing* This 
is of course relevant to other models (for example, Gardner, 1970), which 
have claimed that perception is based on unlimited parallel channels, and that 
confusion errors arise only at the decision level* 



The next prediction concerns texture segregation. Early detection of 
boundaries is a primary requirement in perception. Before we can identify an 
object, we must segregate it from its background. If texture segregation and 
figure- ground grouping are pre- attentive processes, depending on parallel 
registration across the visual field, the theory predicts that they should be 
determined only by separate features and not by changes in the conjunctions 
of features. So we should group two sets of items easily on the basis of 
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color (e.g., red shape* vs. blue ones) or a simple feature of shape (e.g., 
curved vs. straight), but not on the basis of conjunctions of these properties, 
(red curved and blue straight letters vs. red straight and blue curved letters). 
The next. three slides confirm these predictions. 

The theory actually makes quite a strong and somewhat surprising claim 
about this early perceptual grouping which sets up the candidates for object 
identification. It implies that preattentive organization exists only within 
dimensions — within a color map, within a shape map, within a map of move- 
ments or orientations — and that these maps are related to each other only 
where and when attention is focused. This suggests the possibility that we 
might effectively camouflage an object at this preattentive level by placing 
it at a boundary between two groups, ear v of which shares one of its features. 
We can choose an object which within either group alone would be quite salient, 
and see if adding the second group makes it harder to see. Figure 5 shows an 
example. In order to detect the presence of the red 'X*, attention has to be 
narrowed down to exclude the adjacent red 'Os' and blue 'Xs' and focused on 
the item itself. Yet in either group alone, it would be quite easy to detect. 
If, on the other hand, the target has a unique feature (for example the color 
green or vertical lines) , we would expect detection to be independent of 
grouping. We ran an experiment to test these predictions, and the predicted 
camouflage of the conjunction target was clearly confirmed: subjects took 
135 msec longer to find the conjunction target than the feature targets. In 
fact, they missed it altogether on 9% of trials, evan though the display re- 
mained on until they responded, what seems to happen is that two competing 
ways of grouping this display exist — one within the preattentive color map 
and one within the preattentive shape map. The conjunction target exists in 
neither of these maps, while the feature target is always unique in one of 
the two. Even when we place the targets in the center of a group instead of 
at the boundary, it still takes longer to detect the conjunction than to de- 
tect the feature target. The presence of distractors elsewhere in the display 
which share the locally distinctive feature of the conjunction target forces 
us to narrow attention <?c*m, at least to exclude these irrelevant distractors. 
Luckily in normal life the preattentive boundaries of our multiple feature 
worlds are likely to agree. The features of real physical objects have 
highly correlated spatial boundaries. The edge 5/ of the dog co-exist and move 
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together, whether we define them by their color — brown, or by their tex- 
ture — furry. 

What is the next prediction? I have been claiming that we cannot iden- 
tify a conjunction of target features without locating it in order to foqus 
attention on it. This need not be true of simple features. For these wef may 
be able to detect their presence or identify ttem without first locating them. 
In fact, the occurrence of illusory conjunctions suggests that features could 
b* to some extent free-floating, not precisely pinned down or labeled with 
their spatial locations. In order to see boundaries between areas with dif- 
ferent features, we need to locate the position of the discontinuity. Buc 
within any perceptual group we may not know, at the feature level, where an> 
particular item is. Of course we can rapidly find its location when we have 
detected itF presence. But my hypothesis was that this might require an 
extra operation, which might take a measurable time. 

I set up an experiment to test this idea, simply by looking at the de- 
pendencies between the two types of judgment, identification and localization. 
If we are unable to identify conjunctions, without locating them, the depen- 
dency ought to be complete, in other words, we should never identify a con- 
junction without also knowing where it is. This should not be the case for 
features, if locating them is actually a separate operation from identifying 
them. The task required subjects to make a forced choice identification of 
which of two targets was presented in an array of 12 colored letters. In the 
conjunction condition, the targets were a retf 'O 1 or a blue 'N' and the dis- 
tractors were red 'Ns' and blue 'Os 1 . In the feature condition, the targets 
were an 'S' or an orange letter with the same distractors as in the conjunq- 
tion condition. We set the exposure duration (followed by a mask) to get about 
80% correct identification of the target in each condition. We then asked 
subjects not only to identify but also to locate the targets, by writing their 
answer in a matching grid. We found that subjects were very unlikely to 
identify the conjunctions without also knowing where they were, while this 
happened on about 40% of trials with feature targets. We got the same result 
when we matched the exposure durations instead of matching the error rates. 
Again, we have evidence that detection of a feature can occur not only with- 
out focused attention, but also without information about its spatial location. 
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At tome level we may have "free-floating" presence or absence information for 
features but tot for conjunctions. 

Let me sunn arize the conclusions so far. I think we have found some 
support for most of the predictions I made earlier. I would like to empha- 
size the importance of their convergence rather than putting too much weight 
on any one alone. It seems that we cah identify separable features in paral- 
lel across a display. We do this separately within a minber of independent, 
pre-attentive feature maps. Relating any individual feature to other features 
of the same object requires an additional operation. This feature registra- 
tion can mediate texture segregation or figure ground grouping, aftd can lo- 
cate potential objects to be identified by a serial scan with focal attention 
at the next level up. Identifying conjunctions of features on the other hand, 
requires focal attention and serial scanning of locations; it therefore cannot 
mediate texture discrimination. This finding is largely independent of 
spatial density, difficulty and practice, although these variables may change 
the rata of serial scanning. The conclusions seem to apply not only to fea- 
ture values on different dimensions (color and shape) but also to local 
elements of shapes, lines, curves, and angles). Thus we have converging 
evidence from a nvmber of different paradigms and stimuli all of which so 
far supports the theory. 

However, you may have noticed a rather glaring omission. I didn't ven- 
ture to test the central claim of the theory — that illusory conjunctions 
would occur if we prevented focused attention. The reason I hesitated, of 
course, was that I didn't really believe they would occur. Nevertheless, 
shame eventually began to prevail over my fears, and I decided I could no 
longer put off doing the most obvious experiment. So we devised a pilot ex- 
periment to try at least on ourselves in the privacy of the testing room. 
We used displays like those in Figure 6. The primary task was to attend to 
and report the digits; we then attempted to write down all we had seen of 
the colored letters. We stopped, discouraged after a few trials, convinced 
we were seeing correctly the one or two items that we managed to report. Each 
of us found it hard to believe she had written down almost as many illusory 
conjunctions as correct items. Having clearly seen a pink 'T', we were re- 
luctant to accept the evidence on the card which contained a pink 'X 1 and a 
green 'T 1 . I tell this anecdote because it is important to distinguish 
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whether I'm describing just wrong guesses, memory failures, or whether these 
responses reflect genuine perceptual experiences. We certainly believed we 
had "seen" at least some of the illusory conjunctions rather than guessed 
than in the absence of information. I would like to claim that what we are 
studying reflects the way we construct our mental experiences of the outside 
world. 

We tested other subjects on the same displays, to see whether they 
replicated our results, asking them to report only what they saw or were rea- 
sonably confident they saw — not to guess. We found that most did make a 
large number of conjunction errors, in fact almost as many as correct reports. 
They averaged about one illusory conjunction in every two trials. If we 
compare these errors with those in which they got one feature correct and the 
other a feature intrusion which had not appeared on the card, we find they 
were three times as likely. Again, it is worth mentioning another anecdotal 
observation, in relation to the question whether subjects were guessing or 
'seeing' the illusory conjunctions: several subjects stopped after a few 
trials and spontaneously made comments like 'Oh, you are tricking me. The 
numbers were colored that time'. No "demand characteristics" or response 
bias will explain that unrequested observation, since we had told subjects ^ 
that all the digits were black. 

The next question we asked was whether there are any constraints on these 
illusory reco\*olings. For example, can we take the red from a small, outline 
circle and use it to fill in the area of a larger triangle that was origin- 
ally blue? Or are there limits to the sizes, shapes, areas and distances 
^ between which the exchanges can occur? The letters in the first experiment 
were the same size; did this encourage the promiscuous mixing of colors and 
shapes, or would we get it as freely with more heterogeneous displays? We 
tried displays in which we deliberately varied the color, size, shape and whe- 
ther the color was filled in or outlined (as in Figure 7). Some displays 
varied in only two features, some in three ard some in all four. 

We also ran two different attention conditions. It is important to the 
theory that illusory conjunctions should result from attention failures, and 
not from any other form of difficulty. We therefore compared this divided 
attention with a focused attention condition. In the divided attention 
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condition , subjects attended to and reported the digits as their primary 
task, and were cuec* only after the display was presented which colored shape 
to report. In the focused attention condition, n the other hand- the digits 
could be ignored, and subjects were given a spatial cue 150 msec in advance of 
the display, telling them which colored shape to attend to and report. We 
matched the overall accuracy in the two conditions by reducing the exposure 
duration in the focused attention condition until the stimuli were hard to 
see. To summarize the results: (1) first in the divided attention condition, 
all four features were liable to switch. Subjects reported illusory exchanges 
of size and outline versus filled as well as of shape and color. All four di- 
mensions seem to be separable by this criterion. (2) Illusory conjunctions 
were as frequent with heterogeneous displays as with more homogeneous ones. 
Colors were as likely to switch between items which differed cn the other three 
features as between items which were otherwise identical. (3) Illusory con- 
junctions do seem to be linked to attention load rather than to discrimina- 
bility or task difficulty in general. When we changed from post-cued, divided 
attention to pre-cued, focused attention, we found a big shift in the type of 
errors. With attention cued in advance and no primary load, subjects made 
very few conjunction errors; either they omitted the item altogether, or 
they were a/* }.i v 3ly to report a feature that was not presented as a feature 
from the wro:ig location in the display. 

These last three experiments have confirmed the initial premise of the 
theory. To a first approximation, each feature seems to be coded as an in- 
dependent entity; it can migrate without constraints from its source or desti- 
nation, if attention is diverted elsewhere. An object can be as confidently 
seen when its conscious representation is generated from the color and size 
of one object and the shape and filled in property of another as when it 
accurately matches the features of a physically presented stimulus. Features 
are exchanged as freely between objects which differ maximally as between 
otherwise identical objects, even though ^his usually requires a change in 
the conscious representation * f the migrating featrre. For example, moving 
a color betv^en objects of different sizes or between one outline and one 
filled object must also change the amount of color perceived. 

The implications of this conclusion, if we accept it, are quite far- 
reaching. II suggests that the internal representation, on which conscious 



experience depends, contains discrete labels of values on each dimension 
separately rather than a wholistic, interactive record. The whole object 
must be resynthesized from a set of these discrete feature labels, which may 
have been accidentally interchanged. If in a brief glance only the labels 
'blue 1 , 'small 1 and 'triangle' are registered, we sipply our conscious image 
with the correct quantity of blue coloring t. fill the specified area, re- 
gardless of how much was initially presented. This hypothesis involves an 
extreme interpretation of the notion of feature separability, and places con- 
scious seeing at a greater remove from the physical stimulus than we would 
intuitively assime. I believe the Gestalt psychologists were right in claim- 
ing that consciousness is peopled not by disembodied features but by objects, 
their backgrounds and their interactions. But in terms of the operations 
which mediate this perception, the elements which code the accessary infor- 
mation appear to be abstracted from their contexts; discrete and independent 
labels which require attention for their correct resynthesis. 

The last question I want briefly to touch on is one that I'm sure has 
occurred to many of you as you listened. So far, I have talked as if we knew 
quite clearly which properties count as separable features and which have to 
be put together as conjunctions. Jrufact, of course, this is far from ob- 
vious, and is, I believe, ari^em^Irical question. My claim has been that 
attention is needed whenever we have no population of specialized detectors 
which directly sense a particular feature in the display. Dees this put me 
into a vicious circle in which I define a separable feature as one which re- 
quires attention before it can be correctly integrated with others? One 
answer is yes, but it is not the whole answer. The escape from the circle 
is through a variety of converging tests for separability, all of which, we 
hope, will pinpoint the same candidates for separable feature- hood. The stra- 
tegy is to choose two features which are most likely to be separable — for 
example we might use color and line orientation — to establish across a 
variety of tasks two different behavioral syndromes typical of features on 
the one hand and conjunctions on the other. So, just as a physician uses 
spots, a fever and a sore throat to diagnose measles, we can then use the fea- 
ture syndrome — parallel search, location errors, illusory conjunctions, tex- 
ture segregation — as new diagnostic criteria for separability to add to 
those proposed by Garner. We may then be able to apply these tests to help 
us decide, for example, whethe more dubious features like closure or 
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intersection or symmetry qualify as elementary features, whether faces are 
recognized as unitary wholes or built out of features, and so on. We can also 
look at perceptual learning to see whether new detectors can be set up, regis- 
tering conjunctions of previously separable features as unitary wholes • 

With color and shape, there may be built in constraints on unitization. 
For component parts of shapes (e.g., curves, lines, angles, etc.) it would 
make biological sense to allow some flexibility. We want to be able to develop 
automatic detection of important faces and places. Another difference between 
color-shape conjunctions and shape component conjunctions is that the latter 
may produce emergent features (Pomerantz et al. , 1977). For example, con- 
joining L with \ produces a triangle hk , which has a new property, closure. 
Are these emergent properties also picked up by separate populations of fea- 
ture detectors? If so, they could affect the probability of illusory conjunc- 
tions. People might be less willing to see a A given an incorrect conjunction 
of L and \ , simply because the closure feature was missing. The more salient 
an emergent feature is, the more its absence should inhibit the formation of 
an illusory conjunction. One test of this question is to compare two con- 
junctions of the same features, the diagonal line and the right angle — one 
which generates closure and one which does not ( bk versus \£ ) . We are finding 
more illusory conjunctions with the arrow than with the triangle (24% vs. 15%). 
Closure may be more salient as an emergent feature than intersection. But 
this generates another prediction: if there is an emergent feature of clo- 
sure which is picked up by a separate population of detectors, it should also 
behave like a feature in the other tests: it should allow parallel search 
and mediate texture segregation. So search should more often be serial with 
arrows than with triangles (see Figure 8) . This prediction is confirmed. The 
slope of the search function was steeper and more linear with arrows, and the 
mean ratio of positive to negative slopes was close to 0.5 for arrows (0.47) 
but not for triangles (0.22). Texcure segregation was also slower with arrows 
than with triangles, averaging 990 msec versus 786 msec. So, across stimuli, 
the tasks seem to covary as predicted. 

Another prediction is also possible: There were quite marked individual 
differences, particularly with the arrows. This suggests the possibility that 
different individuals may give different weight to different features in de- 
termining whether to 'see 1 an arrow or a triangle. For some people, three 
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lines in the right arrangement mi^ht be enough, while for othere, closure may 
also be essential. This gives us another way of testing the theory; of seeing 
whether the different tasks I've discussed are related in the way I suggest. 
Just as different stimuli may generate performance patterns which covary 
across the tasks, so may individual subjects with any given stimulus. People 
who make illusory arrows out of their parts should also search serially for 
arrow targets in conjunction distractors, and should have trouble segregating 
textures which contain arrows or their parts. On the other hand, people who 
code arrows by some emergent wholistic property, for example the three-way 
intersection, should find an arrow target in parallel across search displays 
and should be able to use the presence or absence of arrows to segregate one 
area from another. 

We tested these correlations across tasks within each type of stimulus. 
The findings were that for the arrows, there were significant correlations 
within individuals between (a) the frequency of illusory conjunctions, (b) 
the difference in search slopes for arrow targets in a conjunction and in a 
feature background, and (c) the difficulty of texture segregation. For tri- 
angles, the correlations were not significant, perhaps because so few illu- 
sory conjunctions were seen. It looks, then, as if closure does function as 
a primitive feature in the same way as line orientation and color. It also 
looks as if the three tasks I used are inter-related in the *ay the theory 
suggests. This is certainly an example of boot- strapping Psychology, but, I 
would argue, not as such inadmissable. 

Finally, is there a personality trait of proneness to illusory conjunc- 
tions? If you make arrows from their parts, do you also tend to make triangles 
from theirs? The answer is 'yes 1 : The within-subject correlation of 0.5 was 
significant. Whether it would generalize to color-shape conjunctions is 
another question. 

That brings me to the end of the data I want to describe today. I will 
return for a last few words to the problems this account may raise for our 
everyday perception of objects and complex scenes, or of words, sentences 
and meanings in reading. Can we reconcile what I've been claiming with the 
apparent speed and richness of information processing that we constantly ex- 
perience? I can only speculate. Perhaps this richness at the level of 
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object: and scenes is largely something we create, an informed hallucination • 
We can certainly register a large array of features in parallel, and we can 
do this along a number of dimensions. But, if we apply more stringent tests 
to see how accurate and detailed we are in putting them together, e.g. - in 
perceiving faces or words, the results tend to be less impressive. Scanning 
a school photograph for my daughter's face, among hundreds of other teen- 
agers is a painstaking business. Proof-reading is also best done slowly and 
serially. In both these examples, contextual redundancy is less useful than 
it normally is. Much of pur peripheral or non-attentive 'seeing' may capi- 
talize on our prior knowledge. It may consist of matching expected features 
to actual features without checking on how they are combined. If so, the 
'wholes' or objects would still exist in our heads. They would be what we 
expect to see and normally end up seeing. Yet they may not be the initial 
code that registers the stimulus in its sensoiy form. I suggest there may be 
three ways in which we can see whole objects, and we may not be aware which 
we have used in any given instance. 1) We may see them, as I've suggested, 
by integrating their features in the spotlight of attention. 2) We may see 
them by predicting their features in a familiar context and separately con- 
firming that each feature is present. 3) Finally, in the absence of either 
prior information or focused attention, we may be reduced to random resyntheses 
which result in illusory conjunctions. 
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Consequences of Visual Orienting 1 
Michael I. Posner and Yoav Cohen 
University of Oregon 

I. Introduction 

During the past several years ve have been studying how peripheral 
visual events produce covert orienting of attention. Much of this work 
has now been published (Posner , 1980; Posner , Davidson & Snyder, 
I960; Remington, 1980; Shulaan, Remington & McLean, 1979), These studies show 
covert attention movements are sufficiently time-locked that we can trace their 
course in terms of changes in the efficiency of responding to probe stimulus 
events that occur at different places in the visual field. 

In this paper we ask: What are the consequences of having oriented to a 
peripheral cue once attention is returned to fixation? We have two basic reasons 
for asking this question. First is an intrinsic interest in the dynamics of what 
influences the likelihood of attending to a spatiil position. Do we tend to 
interrogate a previously nctive area again? Or do we tend to avoid it In favor 
of a fresh source of information? Repeated events are known to habituate, 
but repetition can also yield faster, more efficient response times. Second, 
work on orienting to letters by McLean & Shulman, 1978, has shown facilitation 
of the attended pathway — even after attention is withdrawn. Presumably, attending 
to a letter ennances the efficiency with which it can be activated by input even 
after attention is reoriented toward aether task. We wish to compare the results 
obtained with orienting to visual location with the findings based on orienting to 
higher level codes. 
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II. Basic Paradigm 

Our subjects look at a cathode ray tube display consisting of a central 
box which Is flanked by two peripheral boxes placed 8° to the left. and right of 
the fixation box. The trials begin by 150 mil Usee brightening of one of the 
two peripheral boxes selected at random. A bright probe dot occurs either 
0, 100, 200, 300 or 500 milliseconds following the brightening. The dot is 
usually in the center box (.6), but it may occur either side (.2). Catch trials 
in which no probe is presented occur with probability of .2. Subjects are 
instructed to respond to the dot as quickly as possible by pressing their 
single key. Eye movements are monitored "sing E0G electrodes and trials with 
detectable movements are excluded. The first study used six Ss run on two days. 

Our expectation was that the cued stimulus would su-mon attention. Thus, 
the cued side would have an initial reaction time advantage over the uncued side. 
However, because probes occur mainly at the center, subjects should try to keep 
attention there insofar as they can. Thus, the initial advantage to the cued 
side should be lost as full attention is given to the center. V7e can then 
compare the two sides to observe the consequences of the previous facilitation 
by the cuing. 

The results conform well to our expectation. These are shown in Figure 1. 
The cued side shows an advantage for the first 150 milliseconds. This is replaced 
by a clear inhibition after about 300 milliseconds. The center remains fastest 
throughout as might be expected both by its high probability and its foveal loca- 
tion. It looks as though t.ie consequence of the early advantage to the cued side 
is a subsequent inhibition. 



Insert Fig. 1 About Here 
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Several views about the origin of this inhibition occurred to us: 
1) It could be due to the fact that this is a comparison of two ax ter native 
positions. In many two-alternative reaction time tasks alternations turn out 
to be better than repetitions. Failing to find a probe at the cued position 
shortly after the cue perhaps the subject guesses that the probe is more likely 
to occur at the other position. 2) It could be due to the movement of attention 
away from the cued stimulus in order to return to the center. If the subject 
moves attention back toward the center 9 perhaps he has more difficulty in Leas- 
ing it back to the cued position than he does in allowing it to continue to the 
unc.Med side. 3) Some part of the pathway from the cued location is reduced in 
efficiency by tne cuing. This could occur because of the sensory cue itself, or 
because of the orienting that occurs as a result of the sensory cue. 

III. Four Alternative Experiment 

In order to test the first two ideas, we used a central box and four 
peripheral boxes > each 5° from the central box. The probe occurred at the 
central box with probability .6 and in one of the peripheral boxes with equal 
probability of .1. Otherwise, the experiment was the same as the previous one. 

Figure 2 shows the results for 12 subjects. Comparing the cued position 
with the mean of the other three positions there is an initial advantage for 
the cued side replaced by an inhibition as before. When the side opposite the 
cue (far position) is compared with the two orthogonal (near) positions, it is 
clear that the far position is no faster than the two near positions. These 
results show that the inhibition is not limited to the two alternative case 

Insert Fig. 2 Abouc Here 
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and that a stimulus position in the direction of the assumed attention movements 
from cue to center is not necessarily at an advantage over other positions in 
the visual field. The results thus eliminate the first two explanations for the 
inhibition effect. 

IV. Double Cuing 

In order to examine the role of sensory factors in this phenomenon we 
introduced cuing either by brightening the cued box or by dimming it. If the 
facilitation effect is indeed attentional and not due to forward brightaess 
enhancement, for example, we should get similar facilitation in both cases. To 
check further on the sensory versus attentional character of facilitation and 
inhibition we introduced trials in which both peripheral positions were cued 
simultaneously. To summon attention back to the center we brightened the 
center position 300 milliseconds following the initial cue. No probes were 
presented at the center but they occurred with equal probability at the two 
flanking boxes, either 80 milliseconds following the cue or 500 milliseconds 
following the cue. In ccordance with the previous result we expected the cue 
side to show facilitation in the former condition and inhibition in the latter. 

The results are shown in Figure 3. Data .rom the single cue trials conform 
well to our expectations. Regardless of whether the cue was introduced by 
brighten 'ng or dimming the cued side is initially faster than the uncued and is 
slower for probf s 500 milliseconds following the cuing. This suggests the 
facilitation effect is not due to nny kind of brightness enhancement. 

The results of the two cue condition aremore interesting. The cued sides 
are not significantly facilitated when compared to the uncued side in single 
cue trials. This suggests, in accordance with previous work, that attention 
cannot be split to the two sides when both are cued. However, the inhibition 
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in the two cue trials is as great as that found for single cue trials. This 
suggests that the inhibition effect dofcs not arise fro* attentional orienting 
but fro* the information presented at the cued position. 

V. Arrow Experiment 

To check further on the role of sensory information and attentional orienting 
in facilitation and inhibition we used a central cue rathec than a peripheral 
cue to indicate where to attend. Each trial began with an arrow that occurred 
above the central fixation box. The arrow indicated that the cued side would 
have a probe with probability .8, while the uncued side (opposite the arrowhead) 
would have probes with probability .2. 600 milliseconds after the arrow, 
attention was returned *o the center by brighteninr the center box. Probes follow- 
ing the center brightening are most likely to occur at the center (.6) and have 
probability .2 of occurring on either s<de. Probe events occur either at 450 
milliseconds after the arrow when attention should be on the side cued by the 
arrow, or at 950 or 1250 milliseconds following the arrow when attention should 
have been returned to the center by our logic. 

The results are shown in Figure 4. The cued side shows the expected 
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facilitation following the arrow cue. This is in accord with many other results 
obtained on central cues. However, there is clearly no inhibition following 
the return of attention to the center. Thus the inhibition effact, but not the 
facilitation effect depends upon presenting sensory information at th* periphery 

ERJC 25h 



VI. Conclusion 

Our most direct conclusion is that visual information both summons attention 
and serves to inhibit the processing of further information at that place in 
space. Under many circumstances the presence ol attention will compensate for 
the habituation caused by the cue, but attention seems to have to work against 
*• basic sensory bias that favors fresh information sources. This complex but 
exquisite reciprocal relationship between sensory information and attention 
presumably arises in the need. to be able to sustain information to a 

source of repeated signals and the need to be able to summon resources to 
fresh signal sources. It warns us that although any given experiment may be 
aimed at the study of sensory or attentional processes, careful control is 
needed to tease apart the contribution from each level. Many experiments, supposed 
to be purely sensory, must involve a net effect of both sensory and attentional 
processes. Now that we have methods of the control and time-locking of attention 
to aspects of the signal,, it may be possible to understand more fully the 
contribution of these two levels to the overall information processing. 

The results obtained with orienting to visual positions in space seem at 
first quite different from those that McLean & Shulman (1978) reported for 
attention to letters. When attention is given to a letter it remains active and 
thus is moru efficiently processed than any other letter once attention is 
withdrawn. We have never observed inhibition in the letter experiment. Perhaps 
at a deeper level similar reciprocal relations between purely sensory activation of 
a semantic pathway and attention will appear in further experiments. We have not 
yet been able to locate a model for the thorough examination of this possibility. 

We have begun to apply insights from the reciprocal relation between 
attention and sensory processes to two areas. First, we are trying to understand 
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how people are able to maintain concentration on a source of signals for an 
extended time. Put in its usual terms: How does filtering occur? Our results 
suggest that filters require an active orienting of attention to overcome the 
advantages that accrue to fresh sensory pathways. Second, we are tracing the 
effects of midbrain and cortical injury on these components of orienting. The 
ability to time-lock the operation of cognitive mechanisms of attention, as shown 
in our peripheral and central cuing studies, provides a rich methodology for 
exploring attention, even in patients who have difficulty understanding more 
complex instructions. We expect these methods to be useful in teasing apart 
the underlying neural systems that subserve aspects of visual orienting. 
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Figure Captions 

Figure 1 Reaction time to a probe dot as a function of cue to probe S0A for 
probes on the cued and uncued side and center. 

Figure 2 Reaction time to a probe dot as a function of cue to probe S0A for 
probes at the cued position, position opposite the cue (far) and 
two orthogonal (near) positions and the center. 

Figure 3 Reaction time to a probe dot following a cue consisting of brightening 
(left panel) or dimming (right panel) the surrounding as a function 
of cue to probe S0A (interval) for cued and uncued sides and trials 
on which both sides are cued (both). 

Figure 4 Reaction time to a probe dot as a function of S0A (interval) following 
a central arrow cue for oued, uncued and center positions. 
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UNITIZATION AND AOTQMATICITY U READING 
David L&Berge 
University of Minnesota 

To begin my remarks, I would like to describe the general model which 
is a traditional way of representing information processing and the means of 
its control. Usually every process we wish to represent we put inside of a 
box. For example, when you present a s 'mulus, you put into one box the sen- 
sory feature detection, and a message from this box is then sent on to another 
box containing the pattern processor, and a message from this box is sent on 
to a semantic processing box, etc. Every time we want to talk about some pro- 
cess, we put it in a box. This way of modeling stvtis from the work of Shannon 
& Weaver (1949) and was developed further by Broadbent (1958) . 

Processing Within vs. Between Systems 

In 1975, I tried to draw attention to the processing that goes on within 
a box and to suggest that it contrasts with the kind of processing that goes 
or- between boxes. An example of wi thin-box processing would be unitization of 
a perceptual pattern. Within this box we have feature detecto which aie 
arranged in sotue hier&—:hical fashion leading to the arousal of activation in 
higher o^der perceptual codes, which are long- term memory structures. So, I 
agree with Buz that you cots* off the sensory surface into long-term manory 
right away. What it is thct becomes automatic wher. you see a familiar pattern 
is some kind of an organization of thi se lower structures into the higher struc- 
tures or codes. Lines representing organization of features into codes are not 
merely associations x,\ the way that lines between patterns and their names are 
associations. That underestimates the richness of the properties that are 
going to be required to account for the organization which occurs within the 
perceptual system. At present, researchers are not in agreement as to what 
relationship the components make with tne whol^ pattern, or even if there is 
such a thing as holistic processing. By the way, I should add *hat other pro- 
cesses such as semantic processes, are assumed by sorr<e Investigators to feed 
down into these perceptual levels to help select what aspect of the total pat- 
tern is to determine the response in a given task. So this model is not always 
regarded as a pure Lottom-up model. 
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Another kind of a between- systems processing that becomes automatic is 
the naming of patterns. Another example of within-system automaticity would 
be processing t) rough of knowledge structures . Typically, lines within the 
semantic system are called links, which carry a good deal more than the pro- 
perties of association which we've inherited from the British empiricists, 
namely facilitation and inhibition, and from neurological research, namely 
facilitation and inhibition. Another example of within-system processing *hich 
becomes automatic is motor control. Here, practicing a skill as if it were a 
sequence of associated responses may well be less effective than if it were 
practiced as a search for new ways to organize responses. If I may speculate 
for a moment, it seems clear to me that when I play c*.i arpeggio on a piano, 
that I'm not going to get better by repeatedly articulating individual small 
units, but rather by exploring ways to position the hand as I group the notes. 

We are now wc *ing with handshapes on keyboards to try to determine whe- 
ther souie of these organizational properties which we are borrowing from the 
work in perception can be generalized to the work in motor control. 

Now, to contrast the organization-based automaticity within a system (or 
box) to the association-based automaticity between systems, we may consider the 
Shiffrin and Schneider (1977) theory, whereby latters or digits or other pat- 
terns become mapped onto some category. The category, I assume, can be repre- 
sented as a semantic node, but I don't want tc infer that this is a simple 
node, since it typically represents the complexities * meaning. 

In summary, when automatic processing is within a box, it appears to obay 
different principles than when it occurs between ooxes. However, I do not want 
to actempt to defend the assumption that these two categc ies of automaticity 
are the only ones. It is likely that people in this room can think of a third 
or fourt type of category. 

One of tne important consequences of separating two general modes of pro- 
cessing in this way is that the acquisition of automaticity may well be dif- 
ferent for each case. I think that associating a visual code to its name is 
* matter ± sheer practice, it's not a question of looking at the stimulus in 
different ways, or lookinq at the r»jponse differently. You simply e ercise 
it. As Shiffrin said yesterday, exercising the mappings of visual things onto 



category responses produces automat i city. For example, training a subject to 
press the right button for a letter and the left button for a digit will show 
fast learning among college students , because of the kind of category learning 
that has already taken place before they enter the laboratory. On the other 
hand, the training of a person to p oduce a word unit, to consider those five 
letters now as one word, is not an exercise of qimply looking at those letters 
again and again "n some sequence. What probably happens is that the person 
begins to widen the integration extent for a unit as large as a word as opposed 
to the size of the integration for letters. 

The learning of a larger motor unit, as I sa : d before, is not a matter 
of repetition of finger sequences. Rather, I think you explore new hand shape 
feelings as you learn. As the Russian physiologist Bernstein (1967) said, 
practicing a skill :'.s r~ a matter of repeating old solutions; it is a matter 
of problem solving. Practice session by practice session, what you're really 
looking for is ways that it can be done efficiently, ways of organizing such 
that you can give just one thought to many responses. In other words, you try 
to get the right kind of feel that's going to allow you to chunk as ma^y little 
units as you can. In practicing a skill, you should not simply attempt to 
repeat old things, out \ ou should attempt to look at what you are doing. 



Unitization 



I turn now to the problem of measuring unitization, a process which is 
assvmed to take place within a system. We have tried to measure unitization, 
and to get an idea of what people are doing when they unitize. When you dis- 
play a word, it is not a trivial matter to determine whether or not a person 
is looking at individual letters or the whole word. This problem is made more 
difficult because subject. > ^ave the: capability of perceiving a word either way 
and, in fact, probably often do it either way, within the course of a daily 
college set of classes. One indicator of component vs. holistic word process- 
ing was described by Terry, Samuels and myself (1976). We presented words of 
different lengths, and we assumed that if a person were looking at a word in 
terms of components, whether it be a letter or spelling pattern, that the 
latency of classifying a word would increase with the number of components. 
We used words of lengths three to six and we asked subj^j^s to make a go/no-go 
response on the basis of whether the words were names of animals or not. The 
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main independent variable was word length, and the dependent variable was num- 
ber of letter components. Another variable we considered a*: the same time was 
degradation of the letters. We conjectured that if subjects were looking 
at the component, then degrading it should affect its process-* ^g. We degraeed 
individual letters by erasing sections ol them, such that it took p. person a 
little longer to identify them, but with enough time they could urambj guously 
identify each letter. We predicted that if a subject perceives familiar words 
in terms of the components, then there should be an increase in latency with 
word length. Degradation of the letters should then produce an interaction. 
So we predicted not only a significant slope of latency with word length 
for a normal presentation condition, but with the degradation condition a 
steeper slope. What we found was that for college students there is abso- 
lutely no effect of word length on the latency to identify the category of 
the word, even when letters were degraded. 

We were not leally satisfied with this result because we had no indepen- 
dent evidence that degradation was sufficiently strong to produce a clear test 
of the role of the letter qualify. So, what we did was to induce the subjects 
to process component-wise, by presenting these same words in mirror image. 
The results showed significant slopes for the transformed non-degraded words 
and for the transformed uegraded words, with the degraded condition showing 
the greater slope. So we tentatively concluded that the degradation was 
strong enough to affect letter processing when the subject was using letter 
components. 

With this indicator of component vs. holistic processing in hand, we 
turned to the question of acquisition of unitary processing. We asked he- 
ther beginning readers would show component processing in this task when col- 
lege students showed holistic processing. We took the task into the schools 
and tested subjects in second, fourth and sixth grades and compared them with 
colleqe students. The results of this developmental study showed a large 
word length slope for second graders, with the slope decreasing as grade 
increased, until the slope reached zero for the college subjects. We tenta- 
tively concluded that word processing in early grades was by components (let- 
ters and/or spelling patterns), but that by college age, the processing was 
holistic. 



Now, if we look at this thing called word length as a function of kind 
of task, we observe some interesting differences. So far we have described 
a categorization task in which we get a zero slope. However, in a physical 
matching task, one routinely gets word length effects with highly familiar 
words (Bruder, 1978; Eichelman, 1979). We have tried to eliminate the effect 
in the matching task by cuing words physically and semantically , and by using 
mismatching pairs that differ in all letters and positions but we still get 
a word length effect. From this kind of evidence, \e conlcuded that the match- 
ing task requires a different kind of processing than a categorization task. 
Perhaps matching has more than just one stage. Perhaps there is unitary pro- 
cessing early followed by a check of the individual letters. What contrasts 
the matching task with the categorization task is the possibility that the 
subjects might perform the categorization task without processing components 
at any stage at all. Subjects might short-circuit what the component process- 
ing aspect of perceiving that is necessary in the matching task. 

The lexical decision task (Butler & Hains, 19/9) seems to fall somewhere 
between the matching and categorization tasks in this analysis. Again, I have 
he?rd of no consistent results which eliminate word length. Becker, in a re- 
cent manuscript, consistently gets word length effects across several manipu- 
lations. Naming a word seems to produce a slope effect with the number of 
components in many cases. 

Now, taken together, the results which relate tasks to slope of the word 
length function seem to me to imply something about choice of training methods 
for word identification. Naming words as a training procedure does not trans- 
fer well to comprehension tests. Perhaps because you can sound out a word 
syllable by syllable, you stay at the component level. But for categorizing 
a word, you must unitize it before you can associate it to its proper meaning. 
You cannot get a meaning from letters, but only from the word. So, it would 
seem that asking a person to do a task that depends upon semantic processing 
of some sort can produce some kind of feedback which will encourage holistic 
processing. That would be one speculative conclusion relative to acquisition 
of unitization and eventual automaticity . 
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Automaticity 



Now I turn to a measure of automaticity. We have been using a Stroop- 
type of test called the Flanker test, to get an estimate of how the process- 
ing of words may proceed automatically. Eriksen and Eriksen, in 1974, per- 
formed an experiment in the following way. They assigned four letters to two 
different responses, a lever press right or left. The letters S and C were 
assigned to one response and the letters H and K to the other response. They 
presented a display with these letters, such that one of these letters was in 
the center and three of one of the other four letters were in the flanking posi- 
tions (e.g., CCCSCC and HHHSHHH) . Subjects were told to ignore the flanking 
letters. The warning signal was a fixation cross just below the middle letter. 
The main finding was that a display like CCCSCCC, which we call a compatible 
condition, is processed about 80 milliseconds f as' er than the displo. ~' u e 
HHHSHHH, which we call an incompatible condition. 

I regarded this effect as an indicator of automatic processing of the 
flanker letters, and Shaffer and I attempted to repeat the procedure using 
words instead of letters. We used two buttons and we placed the words one on 

PINE 

top of each in thj following way: IRON. We used only one flanker item in- 

PINE 

stead of three, because we found that the effect shows up with one flanker, 
and also Taylor found the effect using one flanker letter in a study which 
varied SOA in presenting the flankers. His results show a nice mapping of the 
flanker effect over time. 

In the experiment by Shaffer and myself (1979), the word items allowed us 
to evaluate a category effect as well as a response effect. We assigned words 
from furniture and tree categories to one hand, and words from metals and 
cloth* nq categories to the other hand. Thus, flanking words could come from 
not only a diluent category, but a different response relative to the tar- 
qet word. One other thing I might mention is that using categorization pro- 
vides a real convenience in experiments of this sort because one can use a lot 
of different words; because categories are assigned to responses, one does not 
have to pretrain SR assijnments or mappings of individual words. We put in 
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neutral flankers occasionally in order to evaluate whether our compatibility/ 
incompatibility effects were providing cost and/or benefits. 

The results showed s gnificant differences between neutral flanker condi- 
tions and compatible and incompatible flankers. In addition, it appeared 
that flankers of a different category but having the same response assignment , 
produced significantly slowe responses than cases in which the flanker and 
target words came from the same category. 

We can simplify the task just by using two categories , one assigned to 
each hand. We used words from *:he categories of body parts and animals. Our 
pilot work indicates that second graders show no compatible/incompatible effect 
but sixth graders do show a compatible/incompatible difference, implying that, 
like the word length effect for unitization, second graders show no evidence 
of the flanker effect for automaticity. Unitization and automaticity seem to 
go together. 

There is a problem in choosing the proper kind of neutral flanker to get 
an estimate of facilitation and interference in these flanker tasks. The pro- 
per choice of neutral flanker is not always an easy decision. You may devise 
a lot of different types, but not all of them fall in between the incompatible 
and compatible flankers. I'm not sure exactly why this variability occurs. 
But if any of you want to use the flanker task, and want to put in the neutral 
flankers in order to evaluate facilitation and inhibition, be careful in your 
choice of the kinds of neutral words. 

Now, recall that we considered how one might unitize a pattern. I pre- 
sune that what one does with word patterns is to look at more than just the 
letters, to order them, or consider them as a group. But also, one must con- 
sider a wider view and pick up perhaps whatever is happening inside the boun- 
daries of the word. I don't like to use the term "relation", because it sounds 
very unscientific unless you're a mathematician, and we thought that we 
couldn't figure out what it was subjects were looking at in between the word 
boundaries mainly because one can't point to relations. Consider the dif- 
ference between my two fingers: you can't point to it. You can point to my 
fingers, but not to the difference. As Bertrand Russell pointed out in his 
1912 book on philosophical problems, consider that Edinburgh is north of Londc 
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you can point to London and you can point to Edinburgh, but you never point 
to "north of". It simply doesn't exist, it's a Platonic universal, not some- 
thing that is real- So it's no wonder that we have trouble communicating with 
each othe- about those things called relationships. But perhaps we could 
tell when the person is looking at something that must be other than just the 
little things you can point to or put nouns to. 

The Attention Spotlight: An Attempt to Measure Its Size 

What we did was the following. We presented words like "horse", a word 
with an "R" in the center, and "salad", a word with an "L" in the middle. If 
that middle letter was an "R", the subject was to press the right button; if 
the middle letter was an "L", the subject was to press the left button. Occa- 
sionally we presented an arrow pointing left or right, at the center or at the 
end letter position. Instructions were: "if you see an arrow appear and it 
points to the right, press the right button, if it points to the left, press 
the left button." This is not a very difficult task. We felt that if the 
response time to the arrows was sensitive to the size of the attentional 
spotlight focusing on the middle letter, then there would be a longer latency 
at the outside position of the arrows thar at the center point. The other 
condition that we ran, in order to vary the spotlight ^ize, was to ask the 
subject to classify the word as either an animal or a body part. We reasoned 
that ii this case, if the person is looking at the word component-wise and 
reading from left to rignt, we should get a faster response at the left end 
position. On the other hand, if the word categorization condition induced 
subjects to focus on the word* as- who le , then the latency to the arrows should 
be constant across all three positions. 

The results are based on 20* arrow probe trials with 12 ubjects in each 
grou^. We found that the position of the arrow made a significant difference 
for the letter (R or L) condition but not for the word condition. In other 
words, the probe by task interaction was highly significant. Error rates were 
1% in the letter task, 9% in the word task. 

There is a problem with this procedure having to dc with stimulus response 
compatibility. If the arrow is presented on the left and points to the left, 
tfw response is fast. But if an arrow in the left position points to the right, 
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the response will be slower. We believed that it was very important to shift 
the response topography in such a way as to eliminate the compatibility ef- 
fect if at all possible. So we repeated the experiment using a lever which 
we pressed forward in a go/no-go design, which seems to eliminate the problem. 
In this case, we used "Bs" against "Rs" . For a "B" you press forward, and for 
an "I you make no response. Occasionally we presented arrows, not only 
center and ends of the five- letter words, but on the outside too, because we 
wanted to map out more positions around the word pattern. There were five 
subjects in each group. 

The same pattern of results appeared: a relatively flat curve for word 
judgments and a V-shaped curve for the letter judgments. Six of the subjects 
I was able to test across the two task conditions and thus get a better esti- 
mate of the relative height of the curves. The results showed that the center 
position gives about the same latency for the tv - corditions. 

There was one other experiment which used this arrow test. This experi- 
ment separated the letters because it was thought that having a space might 
affect letter detection. When letters are closely spaced there may be a sup- 
pression effect much like a flanker test showed. So we put a space between 
each letter and we got about the same effect that we got before. In fact, you 
can almost put the curves over each other. 

Here then is one way we're currently trying to understand what might be 
happening when a person looks at something as a unit, as opposed to looking at 
it as a component part. 
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Abstract 



Predictions of a class of two-state and a class of continuous state 
attention models are tested in an experiment requiring subjects to detect 
and locate targets. The model predictions are tested by comparing the 
effect of focused versus divided attention instructions on probability of 
correctly locating a target* The class of two-state models predict that 
target location performance will be better in the focused than in the 
divided attention condition while the continuous state models predict that 
the reverse will be true* Results from an experiment testing these predic- 
tions seem to favor the continuous state models* 
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Two State Versus Continuous State Models 

In this paper wc present an analysis of two-state versus continuous 
state attention models. Eacli class of models shares the assumption that at 
any given instant, the human observer has a limited amount of processing 
resources, capacity or attention. The classes differ, however, in assump- 
tions concerning the internal representation of the stimulus used in deciding 
whether in a visual search task a target is present. In the class of two- 
state attention models, it is assumed that the internal representation can 
be treated as a two-valued random variable, e.g., this might be interpreted 
as a stimulus code in which a categorical code for the stimulus is retained 
instead of more detailed sensory information. This assumption, of course, 
does not exclude the possibility that the internal representation of the same 
stimulus is better treated as a more than two valued random variable at other 
points in time or other stages of processing. In the class of continuous 
state attention models it is assumed that the internal representation used 
in decision-making is best treated as a continuous valued random variable, 
e.g., this might be interpreted as a stimulus code in which a great deal of 
the sensory information is retained. Our analysis of these classes of models 
draws from three sources: work in psychology on attention to multiple sources 
of information (Kinchla, 1974; Eriksen & Spencer, 1969; Cardner, 1973; 
Shiffrin & Gardner, 1972; and Shaw, 198J); work in psychophysics on two- 
state and continuous state models (c.f. Krantz, 1969); and work in formal 
search theory (Stone, 1975; Mela, 1961; Kadane, 1971; Tognetti, 1968; Koopman, 
1959) ) concerned with performance optimization in detecting and locating 
targets. The outcome of our analysis is the derivation of testable predictions 
discriminate between two-state and continuous state attention models. 
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Two State Versus Continuous State Models 

Organization of the Paper , We begin by presenting the class of search 
problems to which our attention models apply. This is followed by brief 
reviews of related literature in attention, psychophysics and search theory. 
Next, we give a description of the experimental paradigm we use and then 
present the formal models and their predictions, 

THE SEARCH PROBLEM 

Suppose there are n locations in space and the internal representation 

of information at each location can be treated as a random variable. Each 

random variable, x^, is one of two types: target (i=l) or nontarget (i-0) 

depending on whether a target or nontarget is present at location L^. The 

X^'s may be dependent or independent. The pattern of types of random 

variables over n locations is denoted by S. . . * _ . 

— 1 2 3* n where *k or 1 

depending on whether a target or nontarget is in L^. The probability of the 

pattern S • is denoted by II. . . We consider two possibla 

T2 M,l n V^'V 
search objectives: detection and whereabouts. In a detection search the 

objective is to maximize the probability 02* correctly deciding whether a 

target is present or absent in at least one location regardless of where it 

is. In a whereabouts search the objective is to maximize the probability 

of correctly deciding which location, if any, contains the target. In these 

search problems, it is assumed that there is a limited resource (attention, 

processing capacity, search effort, search time) available for processing. 

One type of optimization problem is how to allocate the limited resource 

among the n locations to maximize one of the objectives given above. In 

this paper, we show how the optimal allocation of resources during search 
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Two State Versus Ccntinuous State Models 

depends on the type of search and the information acquired during search. 
Furthermore, it is shown that the question of whether both optimization 
problems can be solved with the same allocation of resources depends upon 
assumptions about the information acquired during processing or search. 
More specifically, if the information acquired can be represented as a 
continuous valued variable (continuous state model) then simultaneous 
optimization of detection and whereabouts search is possible. If, however, 
the information acquired i« represented in terms of a two-valued variable 
(two-state model), then each optimization problem requires a different 
allocation of resources. Our strategy is to use this difference in whether 
detection and whereabouts can be simultaneously optimized to provide an 
empirical test between these classes of models* 

REVIEW OF RELATED WORK 

Attention and Set Size Effects , It has been observed many times that 
dividing attention among mere locations in space leads to a decrease in 
performance accuracy. Two possible explanations ot this affect are that the 
decrease in accuracy may be due only to the increased opportunities for 
making an error, and that accuracy may also be lowered because less atten- 
tion can be paid to each location. Set size effects in visual search have 
been found, for example, by Estes and Taylor (1964); Kinchla (1969, 1974(a) , 1974(b) 
Eriksen and Spencer ( 1969 ); Gardner ( 1973 ); Shiffrin and Gardner 
( 1972 ) f to mention a few. 

Recently, one of us (Shaw, Note 1) tested two specific attention models 
in several set size experiments. In these experiments, subjects were required 
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Two State Versus Continuous State Models 

to search through briefly flashed arrays of either two or four letters for 
a target letter. On each trial, the subject made two responses: one 
indicating "yes-no" whether a target was present or absent (detection 
search); and the other indicacing which location, if any, contained the 
target (whereabouts search)* In the case of two-letter arrays, subjects 
were instructed to attend equally to both letter positions and in four- 
letter arrays subjects were instructed to attend equally to the four letter 
positions* The two attention models tested were the Capacity Allocation 
Model and the Sample Size Model* The first is a two-state model and the 
second is a continuous state model* The two-state model is based upon the 
assumption that the conditional probability of detection of a target is 
the cumulative exponential function; the Sample Size Model is based upon the 
assumption that each internal random variable has a Gaussian probability 
function* In these models the amount of attention allocated to a location 
influences the parameters of these underlying probability distributions. The 
two non-attentional models assume that the internal random variables are 
continuous: one is based on the Gaussian probability function and the other 
on the exponential. These modelb do not a^ume that attention influences 
the parameters of the underlying probability functions and attention is not 
a parameter in the model. These models were compared to a boundary function 
for which it is assumed there are no processing resource limitations and no 
specific probability function is assumed* The predictions of the two 
attention models, wo non-attentional models and the boundary function are 
presented in Figure 1 together with data from several experiments reported 
in Shaw (Note 1). In the figure, theoretical probability of a correct 
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Two State Versus Continuous State Models 



Insert Figure 1 about hers 

location response for two locations is plotted against probability of a 
correct response for four locations for sevsral models - the two attentional 
and non-attentional models and the boundary function. Data from individual 
objects are indicated by the dots. The data clearly favor the attentional 
models, but the predictions of the Sample Size and Capacity Allocation Models 
are similar and so this data doe* not distinguish between them, flince both 
models give a good account of the data, yet uiffer considerably in their 
underlying assumptions, it is of interest to discriminate between them. 

Psychophysica . 

Two-State Threshold Models. Early investigators tested the sensi- 
tivity of sensory mechanisms ^n "threshold" experiments where the stimulus 
(target) was present on nearly all trials, . A few "catch trials" (target not 
present) were inserted to insure that subjects did not say "yes" just because 
they knew that the target was always present. Relatively few catch trials 
were used because it was believed that a subject's tendency to report the 
perception of the target was generally influenced only by whether it was 
above threshold, and not by possible biases in response tendencies. 

Classical high and low threshold models r.re based on the assumption 
that when a barely detectable stimulus is presented, the internal stimulus 
representation is in one of the two posoible states, detected or not detected. 
In a high threshold model a detect state never occurs when the target is 
absent. If the target is present, the subject will be in a detect state if 
the energy of the stimulus is sufficient to evoke an internal state that 
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Two State Versus Continuous State Models 

exceeds some fixed value called a threshold. If the internal state exceeds 
the threshold, the observer reports the presence of th* target; otherwise 
the response is M no. M However, observers do sometimes respond f, y esM when a 
target is absent. The presence of these false alarms led to the development 
of low threshold models in which a detect state may occur when the target is 
absent and ^ internal state exceeding the threshold does not necessarily 
lead to a "yes" response. 

Luce (1963) presented a low threshold model which accounted for ttre 
false alaras clearly present in the data but not predicted by the high 
threshold model. He proposed that each exposure to a given stimulus had a 
fixed probability of producing a given internal state in the observer. If 
a target was present the observer would be either in a detect state with 
probability £, or in a nondetect state with probability 1 - £. Similarly, 
when a nontarget was present the observer would be in a detect state with 
probability £ or a nondetect star* with probability 1 - £• Luce postulated 
two response-selection rules that generate "pairs of hit and false flarm 
rates: (a) report target present whenever in the detect state and for a 
fraction t, of the nondetect states (liberal strategy); or (b) report target 
absent whenever in a nondetect state and for a fraction ti of the detect 
states (conservative strategy). By varying either t_ or u the observer could 
trade off hits and false alarms. Luce shoved that the resulting ROC has 
two linear segments that meet where jt ■ 0 and u ■ 1. 

In this paper, the two-state model we present is a generalization of 
Luce's model from the single information source detection situation to 
situations where there is more than one source of information to attend to 
and there is a limitation on the total resource available for processing these 




Two State Versus Continuous State Models 
sources. It should be noted, also, that our interpretation of the two-state 
model is broader than the traditional one. We assume the model applies 
whenever the stimulus code involves categorization of a stimulus into one 
of two possible states for the purpose of making a decision. Tests of the 
model do not address the quercion of whether this is the only possible code. 

Continuous Models. In 1955, Swets, Tanner, and Birdsall 
questioned the assumption of a threshold that is independent of the subject's 
motivation. Following the Thurstonian tradition, they proposed that the set 
of possible internal states of an observer is best described in terms of a 
one-dimensional continuum of values. The internal representation of the 
stimulus is conceptualized as a random variable that can take on any value 
along this continuum. The probability that a stimulus evokes a value in a 
given interval along the continuum is specified by two overlapping probability 
distributions, one for the target stimulus (or signal) and one for the non- 
target stimulus (or noise). It is assumed that the subject selects some 
value along the continuum as the criterion, and classifies a stimulus as target 
or nontarget depending on whether the value evoked by it is above or below 
the criterion. This criterion, unlike the threshold, is under the subject's 
control and is influenced by motivational factors such as the relative 
frequency of targets and nontargets and the cost of making erroneous decisions. 
Errors arise because of the overlap of the target and nontarget probability 
distributions. A nontarget stimulus that evokes an internal representation 
abcve the criterion gives rise to an error traditionally called a false 
alarm; a target stimulus above the criterion leads to a correct response 
called a hit. Hit and false alarm rates vary depending on the observer's 
criterion. 
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Two State Versus Continuous State Models 

In the present paper, we consider an extension of this model to 
situations with more than one source of information in which the target 
might appear and there is a limited resource for processing these sources. 
V T o decision models have commonly been considered for the multiple informa- 
tion source detection paradigm (yes-no response); integration models and 
independent decisions models. In the former, it is assumed that the obser- 
vation period results in values for the n random variables and the ooserver 
constructs a decision variable, as a weighted linear combination of these 
values: 

y - Wl x u +w 2 x 21 ...+ w n x nl 

If the decision variable £ exceeds a criterion value, 8, the observer says 
"yes;" otherwise she says "no." Independent decisions models, in contra3t, 
assume a separate decision is made for each source before a final decision is 
made. Thus, for each source a decision variable, is constructed as 
follows: 

*k - 0 « x^ik 

The observer says "yes," target present, if Zy fc >c. That is, when c or more 
of the decision variables, indicate target present (y^ » 1). 

There is evidence that subjects can use both kinds of decision strate- 
gies. However, for the class of search paradigms we are considering, the 
evidence overwhelmingly favors the independent decisions model (Shaw, Note 1; 
Mulligan and Shaw, 1981; Swennson and Judy 1981; Starr, Metz, Lusted, 
Goodenough, 1975). We test this model again and use it in our data analysis. 
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Optimization results for detection versus whereabouts search first 
appear in formal search theory (Koopman, 1956; Mela, 1961; Tognetti, 1968; 
Kadane, 1971; Stone, 1975). These investigations assume that search effort 
is discrete ; that the target is always present in one of N locations; and 
that falbg detections of a target never occur . Resource allocation func- 
tions maximizing the probability of detecting a target under the Above 
assumptions have been studied and reviewed by Kadane (1971) and Stone (1975). 
Here we illustrate the optimal allocation for detect 4 on search. The reader 
should consult Stone (1975) for a review of work on whereabouts search. 

Suppose thare is gold coin lost in one of N boxes. For each box, the 
prior probability for the coin is P(i). Let be the probability of 
drawing the target in a single draw, given it is in Box j. Since draws are 
made with replacement, the probability of failing to draw the gold coin in 
the first k draws from Box j is 

6(j,k) - a ± (1 ~0L}) k ~ l 

Suppose also that a fixed number of draws K are permitted. We wish to allocate 
draws among the boxes in order to maximize the probability of drawing the 
gold coin given allocation ♦; that *s to maximize: 

P[D] - Z p(j)[l -a J ) ,,,(J) ], 

Note that 

p (j) 6(j,k) is the discrete counterpart of the rate of marginal return. The 
optimal allocation places the next draw in the box with the highest rate of 
marginal return. In other words, the optimal allocation of draws is to make 
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the n th draw in cell i such that 

P(i)a t (l^ i ) « max P(J)^(l-a j ) r(J,n " 1 » <,)) 

where r(j,n-l,<fr) is the number of draws out of the. first n that are placed 
in Box j by allocation plan <t>* 

Whereabouts search was first considered by Mela in the context of 
military search problems, Kadane (1969) and Stone (1975) have presented 
general results for thiq kind of search under the assumptions given above. 
Here we illustrate the optimal allocation for whereabouts search. 
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Now suppose that instead of trying to maximize the probability of 
drawing the gold coin in one of the K draws (detection search) we wish 
instead tc maximize the probability of correctly specifying the box containing 
the target* Thus, if after K draws the gold coin is net found, we are 
allowed to guess which box contains the coin* In this case, how should the 
K dravs be distributed among the boxes? The probability of correctly locating 

i 

the target (correct whereabouta) is 

P(CW) » p(i)[l-p(i)b(i,<j>(i))] + Ep(j)b(j f <t> (J)) 

- P(i) + r P (i)b(j, ♦ (!» 

where Box i is the box that is guessed when the target is not detected or 
drawn* The optimal allocation of draws to maximize P(cw) is to choose Box 
i such that P(i) ■ max P(j). Furthermore, no draws are allocated to Box i 
and the K draws are allocated among the remaining N-l Boxes so as to maximize 
the probability of drawing the coin if it is in one of these 1ML boxes. The 
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reason for allocating no draws co box i is easily seen from the equation 
for *(CW): if the coiti is in Box i, then it will never be found in one of 
the other N-l boxes and Box i uill be chosen. In this case, the gold coin 
will always be correctly located ai d so there is no point in wasting draws 
in Box i. 

EXPERIMENTAL PARADIGM 

The subject's task is to search an array of letters for a target- letter. 
Arrays contain either two or four letters positioned at the corners of an 
imaginary square in the case of four letters and at the opposite ends of a 
diagonal in the case of two letters. There are four possible S^ patterns 
for two letter displays and 16 possible 8 n patterns in the case of four 
letter displays. 

There are two dependent variables. First, the probability the subject 

says "no," target absent for the detection search given pattern S (S ) 

i j i j kl 

was presented. When displays contain two letters, the subject chooses either 
an upper or lower diagonal position for the whereabouts response. But, when 
the display contains four letters the subjects chooses either the positive or 
negative diagonal for the whereabouts response. On each trial the subject 
must make both a detection response and a whereabouts response, even when 
the detection response is "no." 

There are two kinds of attention instructions for the four<-letter 
displays: focused and divided attention. 

Focused Attention . Here the subject is told to attend to the 
positive (negative) diagcnt.l and ignore the other diagonal. For the detec- 
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tion response, the subject is instructed to say •yes" or "no" only to the 
information in the attended diagonal. For the whereabouts response, however, 
the subject is told to guess the unattended diagonal if the target is not 
detected in the attended diagonal. 

Divided Attention . Here the subject ir told to attend equally to 
all four locations. For the detection and the whereabouts response the 
subject is instructed to use information from all locations in making a choice 

THE FORMAL MODELS 

Model assumptions can be divided into two groups: representation 
assumptions and decision-making assumDtions. The former are concerned with 
the stimulus code (two-state versus contin-ous) used in decision-making 
and how attention affects this code. The decision-making assumptions are 
concerned with how a response is selected given the pattern of stimulus 
codes. 

A Class of Two State Models 

Representation Assumption s. First, attention is assumed to be a 
finite quantity, *. We denote by <f> k , the amount of attention allocated to 
L^. It is assumed that <J> may be partitioned among locations and this <loes 
not change the total amount: 

N 

* - r ♦ k . 

k-l 

This assumption was tested by Shaw and Shaw (-977). 

The third assumption concerns tne probability of a detection state for 
location 1^ given the target is there (X^ - 1) and <J> k has been allocated 
there: M<t> k ). It is assumed that b(<J> k ) is continuous, concave and increasing 
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with <|> k . Naturally, 1 - b($ k ) is the probability of missing the target in 
location L fc . The Capaci ty Allocation Model makes the specific assumption 
that b(<J> k ) - l-e~*k. The curve plotted in Figure 1 is based on this model. 

The fourth assumption concerns the probability of a nondetection state 
for location 1^ given a nontarget is there (X^ - 0) and * fc has been 
allocated there: q(<t> k >. In the Capacity Allocation Model it was assumed 
q(^) ■ 1» that is, false alarms are assumed not to occur. More generally, 
we assume that q(^) is continuous and increasing. The probability of a false 
alarm in 1^ (X^ - 1) is written as 1 - q(* k ). 

Decision-Making Assumptions . Rather than describe a very general 
model, we specify a model for each of three experimental conditions: two 
letter displays, four letter displays with focused attention, and four letter 
displays with divided attention; and each response; detection and whereabouts. 
Furthermore, these response models assume that the target is equally likely 
to appear in each location or diagonal. A more general model is presented 
in Appendix 1. 

Two Letter Displays - Detection Response . It is assumed that 
the subject may adopt either a liberal or a conservative strategy. If the 
subject adopts a liberal criterion, the response ?, yes M ^s given if a detect 
state occurs in either or both locations. In addition a "yes 1 ' response is 
assumed to occur with probability u when neither location results in a 
detect state. If the subject adopts a conservative criterion, then a 
detect state in one or more locations is assumed to produce the response "yes" 
with probability t, and "no" with probability l-£. The response is always 
"no 11 if neither location results in a detect state. 
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Two-Letter Displays - Where a bouts Response * If only one 
location results in a detect response then that location is selected for the 
whereabouts response. If both locations or neither location results in a 
detect state, then one of the locations is chosen at random. 

Four-Letter Displays - Focused Attention , Since S is 
attending to only two locations the d etection response model is the same as 
for the two-letter displays. If either location in the attended diagonal 
results in a detect state then the attended diagonal is chosen for the 
whereabouts response ; otherwise, the attended diagonal is selected. 

Four-Letter Displays - Divided Attention . For the detection 
response the subject may adopt a liberal or conservative strategy. In an 
extreme liberal strategy the subject may say "yes" to some fraction of the 
trials where there are no detect states, while in an extreme conservative 
strategy, the subject may say "no" to some fraction of trials where there are 
four detect states. 

For the whereabouts response the subject is assumed to choose 
the diagonal with the greatest number of detect states or to pick randomly 
when the diagonals^ have the same number of detect states. 

Two-Sta^e Model Predictions . Let P(CW/FA, S^) and P(CW/DA t S^) 
be the probability of a correct whereabouts response in the focused attention 
and divided attention conditions, respectively. Similarly, we denote these 
probabilities for the detection response P(CD/FA, S^) and P(CD/DA, S^). 

Predictions for Whereabouts Response . The two-state attention 
model predicts that the probability of a correct whereabouts response will be 
greater in the focused attention condition than the divided attention 
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condition. This is analogous to th* theorem we described from search 
theory. However, the present model assumptions differ somewhat from the 
assumptions made in formal search theory. First, we assume that it is 
possible for a detect state to occur when a nontarget is in a location. 
Second, we ssume search effort or capacity is a continuous quantity. 
Previous investigations of the whereabouts problems in search theory ha*»e 
assumed that a detect state can only occur when the target is present and 
that search effort is discrete. In Appendix 1 we show that for the case of two 
locations and a uniform prior that on the average, the probability of a 
correct whereabouts response will be greater for the focused attention 
condition than for the divided attention condition. That is, 



If one does not assume a uniform prior, then whether focused or divided 
attention is the superior strategy depends in a somewhau complicated way 
upon the relation betw ?n b(<J>) and q($). 

The inequality in (1) is meaningful when only one target is present 
in the four-letter displays or when two targets are present in the same 
diagonal. If a target appears in both diagonals, then the subject is 
correct no matter what the choice; conversely, if no targets are present 
then no choice of location can be correct. 

Whether the prediction in the above inequality can be observed depends 
on the size of the difference between these two probabilities and the 
precision of the data collected. We computed the expected difference In 
P(W/FA) and P(CW/DA) for the Capacity Allocation Model as a function of 
the difference between the false alarm rate when attention is focused versus 



P(CW/FA) > P(CW/DA). 



(1) 
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when attention is divided. This difference [p(CW/FA) -P(CW/DA)] is plotted 
in Figure 2 for two different values o* The predicted difference in the 
figure ranges from .05 to .45. Thus, it appears that one can ensure that 
the expected difference for this model can be made sufficiently large by 
using display energy conditions where the estimated $ corresponds to 
predictions of P(CW/FA) -P(CW/DA) > .10, 

Predictions for the Det ect ion Response , The value of the 
inequality (1) depends completely on whether subjects can and do follow the 
instructions to attend to a single diagonal in tue focused attention condi- 
tions. To check this we compare detection performance in the focused attention 
condition to detection performance in the two-letter array condition. Since 
in both cases the subject need only process two locations, we should find 
that detection performance is the same in both cases: 

,P(CD/FA, two diagonals) - P(CD/DA, one diagonal), (2) 
A Class of Continuous State Attention Models 

Representation Assumptions . First, attention is assumed to be a 

finite quantity, *. Second, if <fr fc is the amount allocated to it is 



assume^ that 



N 

r< 

k-l 



That is, 4iv 4 diag attention does not result in any change in the total amount* 
Let f fc (X) and f d (X) denote the probability density function of the random 
variables and associated with when target present and absent , 
respectively. It is assumed that f fc and are symmetric, unlmodal dis- 
tributions and have the same functional form* For convenience we assume that 
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the mean of the target random variable is greater than the mean of the 

nontarget random variable. Attention is assumed to influence the variability 

(noise level) of the internal representation. Specifically, we assume 

2 a 2 2 
°\± m where a is constant representing the variance inherent in the 

stimulus re-> esentation not influenced by attention. Evidence that this last 
assumption is consistent with data from set size studies is presented in 
Figure 1. An attention model very similar to this model has b<*en studied 
by Luce and Green (c.f. Green and Luce, 1973; Luce, 1977) in auditory 
detection and magnitude estimation paradigms, 
Decision-Making Assumptions , 

Two-Letter Displays - Detection Response . In the same 
experimental paradigm we use here, Shaw (Note 1) found that the detection 
responses of subjects were consistent with the independent decisions model 
and not with the integration model. This model assumes that the probability 
of a "no" response given is 

Kno/Sy) - P(X 1± < tyPCXj <3 2 ). 

Two-Letter Displays - Whereabouts Response . Most models 
of the continuous variety assume that when forced to choose between n 
random variables the subject chooses the response whose associated random 
variable is the maximum (minimum) of the n. We likewise make this assump- 
tion and so the probability location L. is chosen given S is given by 

ij 

P(no/S ±j ) - - max (X^, X^)) 
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Four Letter Displays - Focused Attention . Since the subject is 
attending to only two locations, the decision model for the detection response is the 
same as for the two letter displays. For the whereabouts response , the 
aubject is assumed to choose the attended diagonal whenever one of the 
associated random variables exceeds its criterion; otherwise the subject 
chooses the unattended diagonal. 

Four Letter Displays - Divided Attention. For the detection 
res P° nse the 8ub Ject is assumed to say "yes" whenever at least one random 
variable exceeds its criterion; otherwise the subject says "no." For the 
whereabouts response we let r - max (X u , X^) denote the maximum of the 

random variable associated with the positive diagonal and s - max (X , X ) 

31c A 1 

denote the same qaantity for the negative diagonal. It is assumed that the 
subject chooses the positive diagonal if r - max (r,s) and the negative 
diagonal if s - max (r,s). 

Continuous State Model Predictions . 

Whereabouts Response. In contrast with the two-state model, 
the continuous state models make the prediction that the same allocation 
of attention optimizes both whereabouts probability ;orrect and detection 
probability correct (see Appendix 1). This predicts that subjects should 
perform no better or more poorly in whereabouts when they are in the focused 
attention conditions (ignoring a location to be guessed). Thus, we have 

P(CW/DA, S^) > P(CW/FA, S^). (3) 

Detection Response . Again, the value of the inequality (3) 
depends completely on whether subjects can and do follow the instructions to 
attend to a single diagonal in the focused attention conditions. To check 
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this assumption we compare detection performance in the focused attention 
condition to that in the two-letter array condition. Since in both cases 
the subject need only process two locations, we should find that detection 
performance is the same in both cases: 

P(CD/FA two diagonals) - P(CD/DA one diagonal) (A) 

The Continuous State Model and its predictions are presented in more 
detail in Appendix 1. 

SUMMARY 

Two classes of attention models make contrasting predictions for the 
probability of a correct whereabouts response for the different attention 
xnstructions we use. The two-state models predict that the whereabouts 
response will be superior in the focused attention condition whereas the 
continuous state models predict that performance will be no oetter or 
worse in the focused attention condition when compared to the divided 
attention condition. Data from the detection response is simply used to 
confirm the assumption that subjects dn indeed ignore the unattended 
diagonal when so instructed. 

For our experiment we choose the two-location case where targets are 
equally likely in all locations and locations are independent. Though the 
theorems we draw upon apply to the dependent case, this does not cause a 
problem for us since we tell our subjects the optimal allocation strategy 
for the dependent case <p(l) - l-p(2)) and then analyze probability of a 
correct whereabouts only on those trials on which the target is in only 
one location or the other. 
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METHOD 

Subjects . Four adults, three females and one male, having normal or 
corrected to normal vision, served as observers. They were paid $3.00 pet 
hour. 

Stimuli . The stimulus displays were similar to those used previously 
by Shaw (Experiment A). Both two-location and four-location dis- 

plays were used. The stimulus at each location consists of a single letter 
flanked by dollar signs. In the two-location display, the three-character 
strings appear at che ends of an imaginary diagonal, separated by 1.25 degrees 
of visual angle. In half of these displays the characters appeared at the 
opposite ends of the positive diagonal (at the lower-left and upper-right 
of the screen); in the other half , they appeared at opposite ends of the 
negative diagonal (upper-left, lower-right). Four-location displays contained 
character strings at all four positions described by the ends of the two inter- 
secting diagonals. 

Varied mapping (Schneider & Schiffrin, 1978) was employed, i.e., the 
target letter changed from trial to trial. Both targets and nontargets were 
randomly selected on each trial from the following group: F, H, J, K, L, M, 
N, T, V, W, X, Y, and Z. Each letter vas used equally often as target and 
nontarget and each appeared equally often in each position. 

The a priori stimulus probability distribution for th£ two single diagonal 
conditions was as follows: I^q) - -AO, ?(S Q1 ) - .25, P(i 1Q ) - -25, and 
P(S n ) - .10. 

Stimuli in the four-location displays are denoted by jLjjj^t where i, 
and ± denote the event in the upper and lower positions of the positive diagonal, 
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and k and 1 the events in the upper and lower positions of the negative 
diagonal, respectively. The stimulus probabilities approximate inde- 
pendence and in this condition were: PCS^^)- .40, KS^^) - .13, 

^oio> - - 13 ' ^ ( s 0 ioo ) - - 13 ' ^iooo ) - - 13 > I(§oioi> " - 02 ' 
^no ) " * 02 ' " - 02 ' and ^ioio> - 

Thus, the probability of a target on only the negative diagonal was ,26, 
on only the positive diagonal, .26, and on both diagonals, .12. Note 
that two targets never appeared on the same diagonal and stimulus patterns 
with 3 and 4 targets did not appear. They occur with such small frequency in 
the independent case that we did not include them. 

Procedure . Subjects viewed the displays binocular ly through a 
Tektronix viewing hood mounted 87 cm from the front surface of a Tektronics 
Model 610 cathode ray tube. The CRT was under the control of a DEC PDP-12 
computer, using its standard letter format* 

Eac4 trial began with a 2-sec presentation of the lecter chosen as 
target, followed by a 3-sec fixation period, and then the 10-msec stimulus 
display. The fixation patterns consisted of two or four dots corresponding 
to the display locations, and an additional dot at the center of the display, 
i.e., at the point of intersection of the two diagonals. The subjects were 
instructed to prepare for each display presentation by fixating on the center 
dot. The importance of maintaining consistent fixation on all trials in 
all session types was stressel. Display presentation was followed by a 2-sec 
period in which a ,l Yes-No H defection response was made, a 2-sec period in 
which a location Judgment w*s made, and a 2-sec feedback period during which 
the target letter and stimulus pattern were redisplayed, 
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On two-location trials, the location judgment required a forced choice 
between the two stimulus locations. On four-location trials, the observer's 
task was to specify in which diagonal — positive or negative — the target 
had appeared* 

In both conditions, subjects were required 
to make a location judgment regardless of tha outcome of their detection 
judgment. 

Stimulus displays were presented in three types of sessions, corres- 
ponding to the three attentional strategies subjects were instructed to 
follow. In a divided attention session (DA) subjects were instructed to 
divide their attention equally and simultaneously between all display loca- 
tions, so as to maximize correct detection responses. In a focused attention 
positive diagonal (FAP) session, subjects were required to attend only to the 
positive diagonal and to ignore the other two locations. They were told to 
base their Yes-No response solely on the attended diagonal. For the loca- 
tion question, they were instructed to choose the positive diagonal if their 
detection response was positive, and otherwise, guess the negative diagonal. 
Similar instructions were given for the focused attention on negative diagonal 
sessions (FAN) except subjects were to base their responses on information 
from the negative diagonal. 

Each DA session consisted of 20 unrecorded practice trials followed by 
three blocks of 152 trials — a positive diagonal two-location block, a 
negative diagonal two-location block, and a four-location block. Order of 
blocks within sessions was counterbalanced. Subjects received ten-uinute 
breaks between blocks of trials. 
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Each FA session was comprised of 50 unrecorded two-location trials, 
followed by two blocks of 152 four-location trials. Subjects followed the 
same instructions (FAP or FAN) for both blocks within a session. The two- 
location trials, always presented first, were given to help the subjects 
focus their attention on the appropriate diagonal for that session. Subjects 
reported this to be a useful procedure in helping them to follow whereabouts 
instructions. As an additional aid, the fixation pattern on four -location 
trials indicated only the diagonal to which the subject should attend. 

Data was collected in 17 sessions (9DA, 4 FAP, and 4 FAN), with where- 
abouts sessions alternated with detection sessions. In the 9 DA sessions a 
total of 2736 two-location trials (1368 for each diagonal) and 1368 four- 
location trials were presented. In the 8 focused attention sessions, 1216 
FAP and 1216 FAN trials were presented. 

Prior to data collection all subjects had participated in eleven practice 
sessions — 6 DA, 2 FAP, and 2 FAN — a minimum of 608 trials per condition. 
During the first two sessions, both of which were DA sessions, display dura- 
tions were gradually decreased from 100 to 10 msec. Thereafter, detection 
and whereabouts sessions were alternated. During these practice sessions, 
display brightness was differentially adjusted for each subject to achieve 
the desired accuracy levels. 
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RESULTS 

Response probabilities (sample sizes in parentheses) are presented in 
Table 1 for all subjects and experimental conditions. Standard deviacions for 
these estimated probabilities ranged from .03 to .009 depending on the value 
of the estimate and the number of observations* 



Insert Table 1 About Here 



Set Size Effects . In Figure 3 the "set size effect" in this experiment 
is graphed as the probability of a correct whereabouts response for the one 
diagonal condition (averaged over positive and negative diagonal data) versus 



Insert Figure 3 About Here 

this probability for the two diagonal, divided attention condition. For each 
subject, the data are consistent with the two attention models and are well 
below the boundary function. The theoretical functions in Figure 2 differ 
from those found in Figure 1 because in our experiment subjects must choose 
between diagonals in the four location condition. lor this reason, the 
boundary curve was computed using a result from Green and Weber (1980) and 
used in Shaw (Note 1) in analyzing data from a similar experiment. The 
equation used Is 

P « ■ 1 4 f 1 - P 2 + 3<l-P 2 ) ] • 
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where P^ is the probability of a correct whereabouts response given four 
locations and P 2 is the probability of a correct whereabouts response given 
two locations. Each estimated probability on the x - axis is based on about 
1000 observations and those on the y- axis each is oased on about 720 
observations. These estimates were obtained by pooling data over occurrence 
of the target in the different possible positions. 

Independent Decisions . The "yes-no* 1 data from the two location (one 
diagonal) conditions were tested against the prediction that 

P(No | S ±i ) - P(X 11 <3 1 ) P(X 2j < e 2 ); 
using a statistic derived in Shaw (1980). This test uses the fact that 

lnP(No| S^) - lnP(X u + lnP(X 2j <& 2 ). 

The standard normal z-score*: of this test are presented in Table 2. The data 
of three of the four subjects is consistent with the independent 
decisions model. These results are also consistent with the assumptions that 
1) subjects divided attention equally between the locations; and 2) the 
chance of a detect state given a target (b(<J>)) is the same as the chance of 
a nondetect state given a nontarget (q ($)). Evidence favoring this assumption 
is important for the analysis of the whereabouts response data testing the 
predictions of the two classes of models. 

Probability Of A Correct Whereabouts Response . In Figure U the proba- 
bility of a correct whereabouts response for the two diagonal divided attention 
condition is plotted versus this probability for the focused attention condition. 



Insert Figure 4 About Hers 
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Recall that the class of continuous state mouels predicts that P(CW | DA) >^ 
P(Cw | FA); this means that points in the graph should lie on or below and 
to tY - right of the diagonal. The diagonal line represents the predictions 
for the continuous state model and the choice of the optimal criteria. On 
the other hand, the class of two-state models predict that P^CW | DA)< P(CW | FA); 
this means data points should lie above and to the left of the diagonal line. 
The predicted relationship between P(CW/DA) and P(CW/FA) for the Capacity 
Allocation Model and the optimal decision rule are represented by the dashed 
line. These predictions were based on the assumption that the conditional 
probability of a correct detect state is the 3ame as for a correct nondetect 
state and this probability is given by the cumulative exponential function: 
1 - e ^. If the probability of a nondetect state given a distractor is 
'.mailer than the probability of a detect state given a target then this 
dashed line will move closer to the diagonal. In fact, if the chance of a 
nondetect state given a distractor is very small then the dashed line will 
fall well below the diagonal. This point is discussed in Appendix 1 under 
two state models c The "yes-no" data presented in Table 1 are consistent 
with the assumption that the probability of a nondetect state given a dis- 
tractor (q (4>)) is greater than or equal to the probability of a detect state 
given a target (b(4>)). (See Table 2). 

Tn summary, it seems that the prediction of the class of continuous 
state models is supported by this analysis of the data. 

Focused Versus Divided Attention . The validity of conclusions reached 
from the data in Figure 4 depends upon whether subjects indeed focus attention 
on a single diagonal and ignore the information in the unattended diagonal. 
We now consider for oach class of models the consequences of a failure of 



ERLC 



312 

299 



Two State Versus Continuous State Models 

subjects to focus attention when instructed to do so. Failure to focus 
attention may result in dividing attention either equally or unequally 
between diagonals and then using the same decision strategy as in the divided 
attention condition. For the class of continuous state models this will 
result in points that will lie on or below the diagonal (on the diagonal if 
attention is divided equally an d the optimal criteria is used). The same 
implication holds for the class of two state models. Thus, failure of 
subjects to follow this instruction will probably result in data below the 
diagonal and consistent with the predictions of the class of continuous state 
models. W3 can test the following consequences of focusing attention and 
ignoring the unattended diagonal for the class of two state models. These 
predictions do not necessarily hold for the class of continuous state models 
because subjects may *hift criteria from condition to condition. However, since 
it is only the class of two state models that are of concern in this pre- 
diction, this does not affect the generality of our conclusions. First, it 
is predicted that .he probability of a no response given only distractors in 
a diagonal should be the same whether that diagonal is the only one present 
or simply the Pttended diagonal. This should be true independent of whether 
a target is present or absent in the unattended diagonal (Tests 1 and 2 in 
Table 2). Second, the probability of a "no" response given one target present 
should be the same whether that diagonal is the only one present or simply the 
attended diagonal (Test 3). 



Insert Table 3 About Here 
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In Table 3 we present the data relevant to these predictions and the 
results of testing them. For these tests the estimated probabilities were 
arrived at by pooling over positive and negative diagonal data. Inspection 
of the table revra. : that only the data of subjects 2 and 4 favored the 
conclusion that they ignored the information in the unattended diagonal when 
they were instructed to do so. 

DISCUSSION 

There are four important results to be summarized from our data. First, 
we found the effect of set size under divided attention conditions is con- 
sistent with attention models as opposed to nonattentional models (see 
Figure 3); the data is also consistent with previous set size findings (see 
Figure 1). In addition, a good account of the set size effects in ours and 
other data is given by two specific attention models; the Capacity Alloca- 
tion Model and the Sample Size Model. The former is a member of the class 
of two-state models we have presented while the latter is a member of the 
class of continuous state models. Second, an analysis of the one diagonal 
detection data indicated that 1) subjects appear to use an independent 
decisions rule; and 2) that if the two-state model holds, then our data are 
consistent with the assumption that the probability of a detect state given 
a target is the same as the probability of a nondetect state given a non- 
target [(b(<J>) ■ q(4>)]. Third, an analysis of detection data for four- 
location divided artentim versus focused attention revealed that only two of 
our four subjects seemed able to effectively ignore information in the 
diagonal to be ignored. Finally, the data of these two subjects were con- 
sistent with the prediction of the class of continuous state models rather 
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han the class of two-state models. 

Our predictions discriminating between two-state and continuous state 
attention models may be applied to information processing situations other 
than visual search. For example, consider date tion and location of an 
auditory signal; detection and identification of pitches; and detection and 
identification of spatial frequencies. However, for these results to *pply 
the situation must involve a limitation on processing resources and control over 
the allocation of this resource. In our experiments, resource allocation 
was accomplished by instructing to focus or divide attention. Alternatively, 
the experimenter may overtly control the subject's allocation of attention. 
For example, Shaw and Bates (Note 2) tested subjects in a visual search 
experiment in which only two fixations were allowed in -earching for a 
target. When subjects were asked to optimize a whereabouts response, they 
allocated all their fixations to one area and guessed the ignored area when 
a target was not detected. In the present- experiments a similar overt control 
of attentional allocation cculd have been achieved by not displaying the 
ignored diagonal, but still allowing it as a whereabouts choice. Other 
constraints on the application of our model predictions are: 1) stimuli 
must be classifiable into one of two types (e.g. target and nontarget); 
2) it must be possible to experimentally approximate equal detectability of 
each stimulus type for all sources of information; and 3) it must be possible 
to put multiple sources of information into two groups and increase the 
division of attention by increasing the number of subcomponents in each 
group. In our experiment this last constraint was satisfied by defining 
the areas to be attended as the diagonals of a square. This had another 
advantage over the use of only two positions in space. In focusing attention 
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on a diagonal there is no advantage to fixating at the upper versus lower 
position as would be the case if subjects were instructed to focus atten- 
tion on an upper versus lower position. 

The classes of two-state and continuous state attention models we have 
considered share two basic types of assumptions. First is the limitation on 
capacity and its conservation over different allocations. Second is the 
assumption that subjects make separate decision for each location and then 
pool these decisions for the "y e8 ~ n o H response. Thus, in each model the 
information about each location is coded categorically. The models differ in 
whether the criterion applied to internal representations can be varied: 
for the two-state model the category boundary or criterion is fixed while 
for the continuous state model it ^an vary. Thus, the two-state models are 

applicable whenever the stimulus domain seems to have relatively fixed 

ft 

category boundaries while continuous state models are applicable whenever 
category boundaries in a stimulus domain are variable. From this view the 
results of the present experiment suggest that letters seen under the 
degraded viewing conditions typical of tachistoscopic studies invoke 
representations the interpretations of which can be as variable and subject 
to motivational factors as, for example, an auditory signal embedded in noise. 
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Footnotes 
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TABLE 1 

RESPONSE PROBABILITIES 
Two Locations 

Positive Diagonal 



Probability of a 
Probability of a "No" Response correct whereabouts 



SUBJECT 


loo 






111 


Upper 


Lower 


1 


.90(392) 


.09(256) 


.12(252) 


.01(106) 


.96(256) 


.95(252) 


2 


.85(419) 


.16(264) 


.15(263) 


.03(112) 


.95(264) 


.96(263) 


3 


.89(417) 


.11(266) 


. .9(266) 


.08(112) 


.96(266) 


.90(266) 


4 


.85(410) 


.17(260) 


.17(260) 


.04(104) 


.95(260) 


.94(260) 



Negative Diagonal 

Probability of a 
Probability of a "No" Response correct whereabouts 



JJECT 


loo 




loi 


111 


Upper 


Lower 


1 


.88(372) 


.18(251) 


.11(250) 


.03(107) 


.91(251) 


.96(250) 


2 


.89(418) 


.19(265) 


.11(265) 


.05(112) 


.97(265) 


.93(265) 


3 


.87(418) 


.08(266) 


.11(266) 


.04(112) 


.93(266) 


.97(266) 


4 


.86(418) 


.19(266) 


.18(266) 


.05(112) 


.94(266) 


.95(266) 
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TABLE 1 (Cont'd) 

Four Locations 
Divided Attention 



SUBJECT 



1 

2 
3 
4 



Probability of a "No" Response 
One Target Present* 



No Target 



.810(509) 
.750(528) 
.770(533) 
.760(534) 



Positive 
Diagonal 

(one target 

.160(341) 
.225(357) 
.250(359) 
.241(360) 



Negative 
Diagonal 

present) 

.240(340) 
.140(357) 
.270(360) 
.260(360) 



Probability of a Correct 
Whereabouts Response 



Positive 
Diagonal 



.870(341; 
.870(357) 
.880(359) 
.860(360) 



Negative 
Diagonal 



.800(340) 
.870(357) 
.845(360) 
.851(360) 



Focused 



Attention 



SUBJECT 

1 

2 
3 
4 



Probability of a "No" Response 
One Target Present 



No Target 

.890(480) 
.870(475) 
.830(480) 
.850(470) 



Positive 
Diagonal 

.185(319) 
.150(318) 
.300(319) 
.165(310) 



Negative 
Diagonal 

.570(319) 
.875(318) 
.560(315) 
.880(309) 



Probability of a Correct 
Whereabouts Response 



Positive 
Diagonal 

.850(319) 
.780(318) 
.750(319) 
.801(310) 



Negative 
Diagonal 

.840(319) 
.885(318) 
.890(316) 
.825(309) 



Focused 



Attention 



SUBJECT 

1 
2 
3 
4 



Probability of a "No" Response 
One Target Present 



No Target 

.870(480) 
.850(452) 
.880(480) 
.865(470) 



Positive 
Diagonal 

.500(320) 
.870(318) 
.705(319) 
.875(309) 



Negative 
Diagonal 

.235(318)' 
.170(318) 
.235(320) 
.165(310) 



Probability of a Correct 
Whereabouts Response 



Positive 
Diagonal 

.835(320) 
.905(318) 
.890(319) 
.860(309) 



Negative 
Diagonal 

.830(318) 
.820(318) 
.755(320) 
.850(310) 



0 **here Is very little data for cases In which two targets were present so we have 
ERIC ot ^c 1 *"!** these figures. 
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TABLE 2 

Standard Normal Scores* for 
Teste of the Independent Decisons Model 

Two Locations 



Positive Negative 
Diagonal Diagonal 



SUBJECT 

i, 

1 .18 -.48 

2 -.11 1.625 

3 -3.17 -2.56 

4 -.31 -.51 

*7he prediction of the independent decisions model is that 

InP^ - lnP(X u <e i ) + lnPd^y. 

This means P^ + *,j 0 - P 1Q + P Q1 . The teet statistic we use is 

loP a * «f InP,, - InP.A - InP*, 

2 m oo u io oi 



S 

/ 



where 

S 
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TABLE 3 



Three Tests of Whether Subjects 
Ignored the Unattended Diagonal 



TEST 1 



SUBJECT 

1 
2 
3 
4 



ONE DIAGONAL 

Pdiols^) 



.890(764) 
.870(837) 
.880(835) 
.855(828) 



TWO DIAGONALS 
(FOCUSED ATTENTION) 



P(NO|S 



attended diagonal and 
Sqq In attended diagonal) 



01 



in un- 



.535(639) 
.872(636) 
.630(635) 
.877(618) 



Ik. 89 

- .11 
11.32 

- .92 



TEST 2 

ONE DIAGONAL TWO DIAGONALS 

(FOCUSED ATTENTION) 



P(NO|S 00 ) P(NO|S 00 in attended 

and unattended diagonal) Z 

SUBJECT 

1 .890(764) .880(960) .6k 

2 .870(837) .870(927) .02 

3 .880(835) .850(960) 1.86 

4 .855(828) .860(940) - .27 



SUBJECT 



ONE DIAGONAL 



P<NO|S 10 , S Q1 ) 



TEST 3 



TWO DIAGONALS 
(FOCUSED ATTENTION) 

P(N0|S, C , S Q1 in attended 
diagonal and S. Q in un- 
attended diagonal) 



ERIC 



i 

2 
3 
4 



.125(1009) 
.175(1057) 
.123(1064) 
.178(1052) 
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.270(637) 
.160(636) 
.268(639) 
.165(620) 
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T>BLB 4 

CONTINUOUS STATE MODEL 
Examples from Monte Carlo Study 

PARAMETERS PROBABILITY OF CORRECT WHEREABOUTS 



V 


£ 1 


H 


♦ 


P(CW/DS) 


P(CW/WS) 


2 


1 1 
X x 


2 




077 


Ana 

#977 










e JrXU 


.891 






4 


8 


942 


• SIS 






6 


12 


.915 1 


.888 






8 


8 


.711 


.693 






8 


16 


.893 


.866 






8 


32 


.986 


.969 


1 


1 s 


4 


4 


.552 


.540 






4 


8 


.678 


.660 






4 


16 


.883 


.799 






4 


32 


.942 


.919 






4 


64 


.993 


.984 


1 


1 1 


2 


4 


.841 


.739 






4 


4 


.552 


.528 


1 


1 .5 


2 


4 


.840 


! .840 






4 


4 


.550 


.540 






6 


10 


.540 


1 .540 


1 


1 .75 


2 


4 


.841 


.812 






4 


4 


.552 


.536 






6 


10 


.546 


.527 
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Appendix 1 

TWO STATE MODEL 
Notation 

Symbol 

L. Location k. 



Internal Representation (code) of the stimulus in where 
i » 0 for nontarget and i ■ 1 for a target. 

Sjj Stimulus Pattern: i 9 j ■ 0,1 depending on whether non- 

target or target in L^. 

P(L^) Probability the target appears in L^. 

♦ The total amount of attention, capacity, processing resources 

or search effort. 
$ k The amount of attention allocated to L^ 9 

b(4> k ) P ^ X ki " 1; V* thc P robablllt 7 of a detect state, D k , 

given a target in and 4> k i* allocated there. 

q($ k ) P ^ X ko " 0; *k* the P robabilit y of * nondetect state, I> k , 

given a nontarget in 1^ and <fr k is allocated there. 

CW Correct whereabouts response. 

CD Correct detection response. 

R^ Choice of in whereabouts task. 

FA Focused attention* 

DA Divided attention. 
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Representation Assumptions 

1. ♦ £ Total capacity is positive* 

2. * * £0^ Capacity is *Mitive or conserved* 

3. right continuous, concave and increasing* 
^(^) is right continuous , concave and increasing* 

In our application of this nod el we have chosen an empirical situation 
for which it makes sense to assume that b($ k ) ■ iWfc) for a11 ^k v Ia othftr 
applications it might make sans* to assume that alvaya starts out with 
tha valua saro and with some probability changes to the value 1. In other 
applications there might be reason to assume that <l(^) < D (^) or that 
h(^ k ) > <l(* k ) »11 \> It ia also possible that the relation between 
b($ v ) and q($.) changes with . 
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PROBABILITY OF A CORRECT WHEREABOUTS SEARCH 



0 
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Hert ve treat only the case of two locations. In our experiment these 
are the positive and negative diagonals. 

Focused Attention , This is also called the whereabouts strategy: attend 
to one diagonal and if fail to detect the target, chen guess the unattended 
diagonal. It follows that 

P(CW/FA) - PO^qCt) + F(Lj)b(«) 
where Lj is the attended diagcnal and is the ignored diagonal. For conven- 
ience we assume FCL^^FCL^). The two possible whereabouts strategies are 
obvious and, for each, the probability of a correct whereabouts response are 
given by 

1. Attend to L 2 : 

P(CW/FA) - P(L 1 )q(#) + P(L 2 )b(*); 

2. Attend to L^: 

PtCW/FA) - PO^bC*) + P(L 2 )q(*). 
What is the better strategy? If either P^) - P(I* 2 ) or q($) - b(4>) it 
does not matter and as we show later the whereabouts (FA) strategy gives a 
higher P(CW) than the divided attention strategy. If we assume P(L 1 )2lP(L 2 ) 9 
then which location should be ignored depends on the relation between q(t) 
and bt*). 

Divided Attention * Here the strategy is to divide attention among the 
locations in order to maximize the probability of a correct detection response 
and then choose the location most likely to contain the target (i.e., has the 
highest posteriori probability for the target given the pattern of detect 
states and the amount of attention allocated to each location. Choosing 
the location with the greatest posterior probability is the best strategy 
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given one must divide attention (i.e can't use the whereabouts strategy) and 
then try to maximize P(CW/DA); therefore, it ic sufficient to show that the 
best P(CW/DA) is always smaller than the best P(CW/FA). 

Tlie probability of a correct whereabouts response given divided attention 

is 

P(CW/DA) - PCrJl^PO^ + P(R 2 I L 2 )P(L 2 ) 
Now, | L^) must be decomposed into four components: 

PO^ I % " 0, Xjq - 0) P^ - 0, x 20 - Ol^), 

p(R x l^-i, x 20 - 0) P^ - 1, x 20 - ol^), 

P(R X 1^ - 1, X^ - 1) PCJ^ - 1, X 2Q - ljl^), 

P(R 1 |x n - 0, X 2Q - 1) P(X n - C, X 2Q - ijLj). 

The same applies, of course, to P(R 2 jl^). For the sake of simplifying nota- 
tion we shorten X^ - 0 to D fc and ^ ■ 1 to D^. 

Given a pattern of detect and nondetect states over locations, the 
choice of a location for the whereabouts response should maximize the possibi- 
lity of a correct choice. The overall probability of a correct whereabouts 
response will then be a mixture of strategies that depend upon these patterns 
of detect ant* nondetect states. Thus, we first consider these patterns of 
detect states and the corresponding optimal choice- In all the cases below 
it is assumed that the optimal allocation plan (<J>) for detection is used 
and there is sufficient # that this plan results in some positive allocation 
to each location. If there is not sufficient *, then the best choice of a 
location given failure to detect in the high probability location, L., is 
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still the high probability location, 1^. This follows because under the 
optimal detection plan * is allocated entirely to Lj and this means 

PO^) [l-b(»)]>P(L 2 ). 

In other words, the conditional posterior probability for 1^ is still higher 
than the prior probability for Lj. (See Stone, 1975). 

We now consider one by one the optimal choice for each pattern of detect 

atates. 

CASE 1 . D x D 2 (X li - 1 and ^ - 0) 

(e) POjDjDj) - h(» 1 )q(» 2 )g(4) 

P(DjD 2 ) 

(b) r(L 2 | Dl D 2 ) - [ l-qc^ici-bcypcy 

The optimal strategy is to choose ^ if PO-jD^ > POjl Djl> 2 ); other- 
wise, choose L 2< We consider the case where PO^) ■> P(L 2 ). The optimal 
evocation of * is <fr. • $ 2 so we have the decision rule choose 1^ if 

b(»/2)q(»/2) >(l-b(»/2)][l-q(»/2)] t 

or when 

b(»/2) > l-q(»/2). 
If b($) - q(0) then the decision rule reduces to choosing if 

b(*/2) > 1/2 . 
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CASE 2 . D x D 2 (X ±J - 0 and X^ - 1) 

(a) POJd^) - [l-b(» 1 )3[l-q(» 2 )]P(I 1 ) 

P^Dj) 

(b) pa^D^) - g^CM^)]?^) 

PCD^) 

The optimal strategy la to choose L 2 If 

q^^b^^Pa^^El-b^^DCl-q^^JPt^); 

otherwise choose L,. We consider the caae where P(L 1 ) - P(L 2 ) and ao 
- ^ optimal choice of a location la If 

q(*/2)b(*/2) > [l-b(*/2)][l-q(t/2)] 

or when 

b(*/2) > l-q(*/2), 

If b($) - q($) then the decision rule reduces to choosing if 
b<*/2) > 1/2. 



CASE 



2. D,D 2 (X 1± - 1 and X^ - 1) 



(a) P^jD^ - b( ( ) )l )[ l-q^flPq^) 

P^Dj) 

(b) PO^D^) - [l-q^jr b^)]?^) 

P^D^ 
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The optimal strategy is to choose if 

CbW 1 )][l-q(* 2 )]P(L 1 )>a-q(* 1 )][b(« 2 )]p(L 2 ). 

We first consider the case where P(Lj) - P(L 2 ) so that 9^ - $ 2 * 
optimal choice of a location is if 

b(*/2)[l-q(*/2)] > [l-q<*/2) ] b(»/2). 

But these are equal so the choice is arbitrary. 

CASE 4 . D x D 2 (Xj^ - 0 and 1^ - 0) 

(a) POJT^Dj) - [l-b((^ 1 )]q((> 2 )P(L 1 ) 

P(D X D 2 ) 

(b) P( T - 2 |D 1 D 2 ) - [qW^JCl-MtyJPOj) 

P(D X D 2 ) 

The optimal strategy is to choose l> 2 if 
P(L 2 )q(* 1 )[l-b((> 2 )]>^l-b((> 1 )]q((> 2 )P(L 1 ). 

Here we consider the case where P(L^) • P(L 2 ) and so ^ - $ 2 « The optimal 
choice is L 2 if 

q(*/2)[l-b(*/2)]>fl-b<»/2)]q<»/2). 
But these are equal so the choice is arbitrary. 

We now derive an expression for P(CW/DS) when P(L^) • P(L 2 ) and the 
optimal response selection rule is used for the case where b($)>l-q($). 
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P(CW/DS) - POj/L^POj) + P(L2/L 2 )P(L 2 ) 
- P(D 1 /L 1 )P(D 2 /L 1 )P(L 1 ) 
+ P(D 1 /J, 1 )P(D 2 /L 1 )P(L 1 ) 
+ P(D 1 /L 2 )P(D 2 /L 2 )P(L 2 ) 
+ P(D 1 /l 2 )P(D 2 /l 2 )P(J ;2 ) 
Substituting the model expressions in the equation above, we have 
P(CW/DS) - b(* 1 )q((}i 2 )Pa i ) 

+ M^Hl-qO^lPOj) 

+ q(* 1 )b(<j. 2 )pa 2 : 

+ q(* 1 )[l-b(* 2 )]P(L 2 ) 

- b(*/2)P(L 1 ) + q(*/2)P(L 2 ) 

- ,5[b(*/2) + q(*/2)] 
Comparison of P(CW/DA) and P(CW/FA) . 

P(CW/FA) » .5[b(*) + q(*)] and 
P(CW/DA) - .5[b(*/2) + q(*/2)] 

Since b(*) >b(*/2) and q (*) >q(*/2) it follows that 
.5[b(*) + q(*)]>.5[b(*/2) + q(*/2)] or 
PCCW/FA) > P(CW/DA) 
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Tha ravarse prediction obtains, however, If b($)<l-q($) : 
that la If conditional probability of a detect state given the target la 
present Is smaller than the probability of this state when the target la 
not present. We leave this prcof to the reader. 
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CONTINUOUS STATE MODEL 
Additional Notation 



Symbol Description 

^(X), f d (x) The probability density function for target and 

distractor random variables 9 respectively. 
The mean of the target and distractor random 
variable , respectively* 



*V *n 



Representation Assumptions 

1, $>o Total capacity is positive 

2. * - E$ k Capacity is additive (conserved under partitions of it) 

3 0 2 . 2_ o 2 is the variance inherent in the stimulus representa- 

tlon* 

4. f (X) - f (X+C) The probability distribution of X^ is a shift of the 
4 t 

probsbility distribution of X^. 

5# f f t are symmetric, unimodal distributions with finite means 

d t 

and variances. 

For convenience we assume thst u n - 0. The assumptions that fj, f fc 
ate symmetric, unimodal with the same variance inherent in the stimulus 
representation are analogous to the assumption that b(<J> k ) - q($ k ) in the two- 
state model. 
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Two State Versus Continuous state Models 
PROBABILITY OF A CORRECT WHEREABOUTS SEARCH 
We first treat the case of two locations and then the case of n Iocs- 

tions. 

TWO LOCATIONS 

Focused Attention (whereabouts strategy). The subject is instructed 
to attend to on. diagonal and if the t.rget is not detected, then guess the 
unattended diagonal. For the continuous state model we assume that the 
subject picks a criterion ^ for the attended location and if \ i <$ k 
then the unattended location is .elected for the whereabouts response. Thus, 
if 1^ ia the attended location 

p(cw/fa) - P(x k0 <B k )Li-p(i lt )] 
+ p ( x ki > e k )p(i lt ). 

We denote by G(Z) the stsndardised cumulative distribution function and then 
writ* 



p(cw/f« - c P*j^]d - Ftyj 



1 - G 



It is ...umed that P^) - l-P^) Md ^ U optlnal ^ . £ wln 

the probability of a correct detection at the attended location), then it 
follows that 



P(CW/FA) 



0 
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Two State Versus Continuous State Models 

Divided Attention (detection strategy). The subject is instructed to 
attend equally to both locations (diagonals) and choose one location (diagonal) 
most likely to contain the target given the impressions from the brief flash. 
In the model we assume that the subject chooses the location with the largest 
random variable. Thus, 

P(CW/DA) - P(L 1 )P(X 2Q ~ X 11 < 0) 

+ P(L 2 )P(X 1Q - X n <0). 

Again, letting G denote the standardized distribution function, the equation 
above can be rewritten as follows: 



P(CW/DA) " P 




If it is assumed (t^ - <t> 2 because P(Lj) - P(L 2 ), then we have 

PU^/DA) - G y 2(? / * 
Comparing this final form for P(CW/DA) and the final form of 1(CW/FA) we see 
P(CW/DA) - P(CW/FA). If subjects use a nonoptimal 3 in the focused attention 
condition we will have P(CW/FA) < P(CW/DA). 
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Two State Versus Continuous State Models 
N LOCATIONS 

In the case of N locations we have Monte Carlo results for the 
Gaussian probability function and the assumption that PO^) - P(Vj) for 
all k and j. 

Whereabout s Strategy . Here one location 1^ Is chosen to be Ignored; 
if for all lfk, < f^, then Is chosen for the whereabouts responses. 
If however, there are some such that ^ >& v then L Is chosen such 
that L r • max {X u >. The probability of a correct choice under this 
strategy Is 



p(cw/ws(fa)) - p(l. ) n g ( g i \ 

1* W<V 



Detection Strategy . Again, It Is assumed all locatlona are equally 
likely to cont'.in the target. The location chosen, 1^, for the whereabouts 
response Is assumed to be the one such that - max (X^). Unlike the 
whereabouts strategy, attention may be allocated to all locations so that the 
probability of i correct detection response Is maximized. Since PO^) • 
?a x ) for all k,l this leads to ^ - (J> k for all l,k. We now write the proba- 
bility of a correct whereabouts response as a function of this detection 
strategy (DS) as 
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Two State Versus Continuous State Models 

P(CW/ DS (F*» - j «(-^-) 6 ("S^-) " ^ 

In Table Awe present some sample computations of P(CW/WS) and P(CW/DS) 
for various values of V a , 0, *. and n (number of locations). These 
computations show that dividing attention equally among the cells and then 
choosing the maximum random variable provides a higher probability of a 
correct whereabouts response than the whereabouts strategy in which a loca- 
tion is ignored. 
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Two State Versus Continuous State Models 



FIGURE CAPTIONS 



Figure 1. The relation between P 2 (probability of a correct location 
Judgment in the two-location task) and (same probability in the four 
location task) predicted by several models and data obtained by Shaw (Note 1) 
in a visual search experiment. 

Figure 2. Predictions of the Capacity Allocation Model ; the difference 
between P(CW/FA) and P(CW/DA) as a function of the difference in the false 
alarm rate for focused versus divided attention for two different values ot 
total capacity,*. f \\ 

Figure 3. The relation between (probability of a correct location 
judgment in the two-location task) and P^ (same probabilityJLa^the^four- 
location task) predicted by the boundary curve and two attention models when 
the choice is between diagonals in the four-location task. The data points 
are from four subjects in the present experiment. 

Figure 4. The relation between P(CW/DA) and P(CW/FA) for the Sample 
Size Model and the Capacity Allocation Model. In general, the class of 
continuous state models predicts data on or below the diagonal line, while 
the class of two-state models predicts data above the diagonal line. The 
data points are from the four subjects in the present experiment. 
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AUTOMATISM AND ATTEN T IONAL PROCESSES 
Lake Wilderness Attrition Conference 

Richard Shiffrin 

I'm going to ure the opportunity today to engage in some general speculations 
about the role of automation and attention processes. I'm going to review some 
old work, discuss some ways in which the notions of automation help explain 
traditional findings in attention, and present some new work. 

To begin with, it's necessary to define the notions of automation and controlled 
processing. This gets into some very difficult issues immediately, because I 
don't think anyone has yet managed to come up with necessary and sufficient 
conditions to define automation or distinguish it from controlled processing 
(although everyone seas comfortable in using these terms). Among others here, 
I have proposed a dichotomous model in which automation and controlled pro- 
cessing are qualitatively different processes, lie put forward first a capacity 
definition: "any process that does not use general processing resources and does 
not decrease the general processing capacity available for other processes is 
automatic. " It seems reasonable that any such process should be automatic. The 
second definition is also a useful condition: "any process that demands resources 
in response to external stimulus input, regardless of the subject's attanpts to 
ignore the distraction, is automatic." This is a rather strong condition; one 
might wish to weaken it by adding an attentional condition that says that a 
process is automatic if it occurs when the subject is not attending to it (or 
trying to initiate it), as opposed to trying to stop it. That is, one can easily 
imagine processes that the subject can Successfully stop but are automatic in 
the sense that they take place when the subject is not attending. A third 
possible criterion involves learning. Most automatic processes exhibit some 
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sort of gradual and continued improvements in performance over long periods of 
practice, and the process of automatization might be defined by this character- 
istic. 

There are numerous problems in trying to use any of these definitions to estab- 
lish a difference between automatic and controlled processes in practice. Some 
of the difficulties arise because almost any task is carried out with a mixture 
of controlled and automatic processing, both in the sense that each type of 
process can initiate the other, and also in the sense that both can be carried 
out simultaneously. Furthermore, these difficulties are increased by the fact 
that the automatic component shows some sort of continuity as a function of 
practice during the gradual course of learning. 

In spite of these theoretical and practical difficulties, which I don't want to 
go into today, it is often possible to determine, the automatic and controlled 
components in the context of a specific experimental paradigm, one such para- 
digm was used a few years back by Wally Schneider and myself. It is probably 
worth mentioning very briefly a couple of those results. The basic idea is that 
a search task is carried out by the subject. Some memory set items to be searched 
for are presented at the beginning of a trial, then a series of characters 
appear in a frame. The subject says whether any of these memory characters 
appear in the frame, yes or no. There's two main conditions of interest, varied 
mappings (VM) and consistent mappings (CM). In the varied mapping condition, 
over trials the targets and distractors trade roles from trial to trial; targets 
in one trial are distractors in another and vice versa. In this condition, very 
little can be learned. In the consistent mapping conditions, the targets and 
distractors remain fixed over long periods of trials; the targets are always 
targets, the distractors are always distractors. Following are the typical 
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results. These are the reaction time results when the subject is presented a 
single frame and is asked to respond as quickly as possible, yes or no. The 
dashed lines are the consistent mapping conditions in which you see that there's 
very little effect of load. However many items the subject is searching for, or 
however many items there are in the display affect the reaction time very little. 
The subject is clearly doing something very parallel and independent of the load 
in the CM condition. However, the varied mapping condition gives rise to the 
traditional search results: a more or less linear increase in reaction time as 
a function of load. 



Very similar results occ - in the case of the multiple frame procedure in which 
accuracy is the measure rather than reaction time, in the multiple frame proce- 
dure, something like 20 frames in a row are presented rapidly and the subject is 
asked to say whether there is a target anywhere in that series of frames. Now 
the question is: does the subject have timtt to find the target (rather than how 
long does it take to find the target). Thuj we examine the probability .of finding 
the target (instead of the reaction time). Otherwise the situation is the same, 
and very similar results come out. I won't go over these in detail, but the 
basic idea is that in the varied mapping condition, the frame time which allows 
a high probability correct is very large. 800 milliseconds per each frame is 
needed if there's a high load, (16) to produce 70% correct. On the other hand, 
in the consistent mapping conditions, these times per frame are all very fast, 
and load doesn't matter very much. 



We were able to fit the varied mapping results in both the multiple frame and 
single frame conditions reasonably well with the serial terminating comparison 
process model similar to Sternberg's except that termination is assuned rather 
than exhaustion of *he search. What I'm going to say a little more about today, 
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is what's happening in the consistent napping conditions where we assume some 
sort of autosuitic detection is occurring. The basic idea that Schneider and I 
proposed was that sone sort of attention process was occurring in the consistent 
mapping condition such that the target items, were attracting attention to 
themselves, and thereby bypassing the necessity for a serial search. 

One source of evidence for this model arises from an attention study. The sub- 
ject is given a relevant diagonal on which he carries out a varied mapping 
search, let's say for letters in letters (letter targets and letter back- 
grounds); there's an irrelevant diagonal in which letters are appearing but 
they're to be ignored all the time. As a series of multiple frames are presented, 
let's say the relevant tsr<?*t is "F". Suppose "3" is an old learned consistent 
mapping target that has been trained previously. Suppose the "3" occurs in the 
same frame as the target that's to be detected, the "F". We compare this case 
with the case when the old trained target, the "3", occurs on the irrelevant 
diagonal in the frame prior to, or the frame subsequent to, the actual target 
to be detected. Then the question is, "Will the presence of this previously 
trained item, even if it's in a "to be ignored" location, hinder the detection 
of the target on the relevant diagonal, even though the subject knows what to 
attend to and is supposed to ignore the other diagonal?" The results are as 
follows: When the relevant target is in the same frame as automatic item, it 
harms detection considerably; it drops detection accuracy by about 20% (which 
doesn't happen when the CM item is before or after the relevant target). The 
explanation holds that the CM target is calling attention to itself, so that the 
attention system orients away from the relevant target which is missed as a 
consequence. 
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Question: Does your subject know in advance, he might get a situation like 
this? 

Answers !fes, these numbers are occurring somewhere on every trial, in some 
frame* They're always to be ignored, so the subject knows that they're occurring, 
and he knows to ignore them; it's not a surprise* They're occurring regularly* 

Sometimes they're ir the same frame as the relevant target, sometimes they are 
somewhere else entirely* But he knows they're there, and he knows he's supposed 
to ignore them* 

Question: Is the attention attracting capability lost as your experiment pro- 
gresses? 

Answer* Mot over the course of the session or two over which we ran this exper- 
iment* Of course, if we ran this study long enough, the subjects would adapt 
eventually* That is, the tendency for this item to attract attention would 
undoubtedly eventually go away, but we didn't run long enough to have that. 

There are some other relevant results concerning the learning of an attention 
attracting response* One somewhat peripheral finding, but worth pointing out, 
is a study that Wally Schneider carried out* The results L just showed you 
indicated a deficit in detection when an automatic target occurred at the same 
time as a relevant VM target* Schneider's results occur when the subject is 
asked to carry out two tasks at the same time, one VM and one CM* One search 
task is automatic (on one diagonal) and on the other diagonal the subject is 
asked to carry out a VM search* But the two targets do not occur at the same 
time* The subject is simply required on every trial to find any target that 
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occurs (either might occur but only one does ). The results show that the subject 
is able to carry out both search tasks as well as when control conditions are 
used in which only one task is required. This shows that the subject is able to 
carry out both tasks together, as well as either alone. This result only occurs 
when the subject is told to focus all his attention on the VM task. If the 
subject is not told what to do, or is asked to divide his attention between the 
two diagonals, he does as well on the CM task as before (although his response 
bias changes) but because he's withdrawing attention from the controlled, the VM 
task suffers greatly. 

This result shows you can carry out VM and CM tasks at the same time. It 
indicates that at least one of these tasks is not demanding capacity, so we can 
do the other one. Clearly, the one that's not demanding capacity is the auto- 
matic task, because that's the one that you don't have to pay any attention to 
by instruction to get these results. So, apparently the automatic detection 
does not require an attentional effort. 

Question: They don't need to attend to the automatic diagonal and it harms them 
if they do so? 

Answer: Yep. That's absolutely right. They don't know that they don't need to 
devote attention to the automatic diagonal. Hence, if you leave them alone, 
they will devote unnecessary attention to the automatic task, attention that is 
not really needed. It may even be harmful to the CM task to do so, but that 
needs to be explored. What is clear is that the VM task is harmed when (un- 
necessary) attention is devoted to the CM diagonal. 
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Question: Have you got those results? 



Answer: Oh yes, I've got then in my briefcase; I can show them to you later. 
There's clearly a big deficit when they're competing with each other. 

There are some reversal results from the '77 paper which are worth mentioning 
briefly for their attentional implications. The subject starts off learning one 
set of letters as targets in another set of letters as distractors in a CM 
procedure. Performance gradually improves in a multiple frame dectection task 
until asymptotic performance is reached, lit that point the targets and distr ac- 
tors are reversed. That is, if the subject has been learning A targets in B 
distractors, he's now told to search for B targets in A distractors. The two 
sets are simply reversed. What happens is a severe decrement. The subject not 
only is not transferring, but has negative transfer. The subject drops below 
the original level of performance seen at the start of training. So it's actually 
harmful to reverse after learning. One might imagine that if the targets are 
attracting attention, that they will continue to do so after the reversal and 
pull attention away from the new targets, which are the old distractors* It's 
important to realize that in this study, there were only two characters in each 
frame. Thus after reversal, there was only one new target and one "old" target 
in the target frame. In many of the studies where we get this negative effect 

r 

of reversal, we find that occurs when in the target frame there's just one of 
these old trained targets. If there's many old trained targets in the target 
frame, we don't get this result. That ie, performance just returns to the VM 
level after reversal. Perhaps hen there are many targets all attracting atten- 
tion simultaneously, attention doesn't go anywhere. At least that's a possibil- 
ity. But in this case, we do get a negative effect, again interpre table in 
terms of attracting attention. 




One \ast result, again with respect to this whole issue of attention attraction, 
is a study Sue D una is carried out on transfer of training. She used a transfer 
of training design that is basically fairly simple. The subject is given CM 
training on & targets with B dis tractors and C targets with D distractors. The 
subject learns these well. At the ssm tine they're trained on VM tasks using 
other characters. After transfer, the targets can be kept the same by keeping 
the old CM targets and using VM background items. Alternatively, the old back- 
ground items can be kept the same, and new VM items can be used as targets. 



The basic question is: What is being learned in original CM training? The 
targets? The distractors? Both? 

A reaction time task was used. These bars show the difference in reaction time 
between frame sizes of 16 and frame sizes of 4. in effect, these are the slopes 
of the search functions. A large slope occurred in initial VM framing, and a 
small slope occurred in original CM training. The results after training can 
be summarized by saying that both target and dis tractor transfer was excellent. 
That is, whether the targets were kept the same, or whether the background items 
were kept the same, very good transfer occurred, about as good as a continuation 
of CM training. 



Question: Why are the CM slopes not zero? 



I'm going to come back to this and say a few words about it later. We find, in 
much of Sue Duma is* work, that CM subjects show a slope that wasn't present in 
the original Shiffrin and Schneider results. The subjects that Schneider and I 
ran originally, did not use what I'd call a great deal of effort in the CM 
conditions. If you're thinking of effort in Kahnsman's old terms, let's say. 



That is, they ran in pairs and thay talked with aach othar, discussed the events 
of the day, world affairs, their homework, and other such things while they were 
carrying out the automatic detection. They showed this flat reaction tine 
curve. Sue Dumais 1 subjects seemed to be a little bit more motivated. They 
were trying fairly hard in the Ul conditions, even though it was an easy task 
for then. We think that, although it wasn't needed, they were carrying out 
controlled search in addition to the automatic detection that was all they 
actually needed* As a result, by mixing these two kinds of search, they pro- 
duced non-zero slopes. Perhaps on some trials they were using controlled 
search, and on some trials they were using the automatic detection. By mixing 
the two, you get a Mall slope with a two to one slope ratio, as opposed to a 
flat curve. We think that's what's happening. 

Question: were they as well practiced as in the earlier studies? 
Answer: Yeah. They were practiced quite well, certainly to asymptote. 
Question: Why does mixing produce linear RT functions? 

Answer: Well, it depends how you mix. There's a lot of different ways to do 
this. A number of different models can be proposed, and this gets a little 
technical, if you mix on a random proportion of trials then these linear results 
will be predicted, other kinds of mixing are a little more problematical. For 
example, if they're competing in parallel, and the fastest wins, things get 
complicated, and then it gets much harder to find a version that will predict 
these results. There's a number of different versions of models that have to be 
considered in that respect. There are some models of a mor* complicated kind 
that still handle these results, but it's much more difficult to find them. The 
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•aiitit thing \ to do is to imagine that the subjects, for one reason or another, 
are randomly using one or the other, on some proportion of trials • Perhaps they 
try both, but some random event determines which gets used. Whether that's a 
good assumption or not, I don't know. We're carrying out, incidently, a test to 
find out if it indeed is the case t£at subjects are mixing controlled search 
into the CM conditions. We're carrying out an experiment in which we give them 
a subsidiary task to do that will take their attention and cause them to devote 
it to something else, thereby removing it from this task and hopefully removing 
this slope. We don't know the results yet. 



How do we explain the transfer results? Well, the model we have in mind is the 
same sort of model I've been proposing in which attention attraction is learned 
to these different items. The notion is that during training attention re- 
sponges will become attached to items* The CM targets get very strong tenden- 
cies to attract attention, and the dis tractors become very weak in their ten- 
dency to attract attention, relative to the neutral amounts of tendency to 
attract attention that VM items get. Then in transfer you can explain the good 
performance for either background transfer or target transfer by the fact that 
one of these items will have a stronger tendency to attract the attention than 
the background. Whether you keep the background the same and put new VM items 
in as targets, or keep the targets the same, and new background items in as 
dis tractors, one of these will stand out with respect to the others because it 
will have a stronger tendency to attract attention. That explains the good 
transfer results. You might ask, "Do we know that VM items have an intermediate 
tendency to attract attention?" The an3wer is yes. Sue Dumais carried out 
another series of conditions which show the same transfer results if, instead of 
VM Items in these transfer tasks, totally novel items are used. That is, new 
items the subjects haven't seen before are equivalent to VM items given much 
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training* That indicates that the amount of tendency to attract attention that 
the VM items have is about the same as novel items have* That is, the result of 
VM training is to leave the tendency to attract attention at a constant level. 



In suamary, th s whole picture looks fairly simple and fairly consistent. The 
items that are trained in a consistent manner as targets are developing a ten- 
dency to attract attention. The items that are trained as distractors are going 
the opposite way: they're developing a tendency not to attract attention. This 
picture looks fairly straightforward, and fairly simple. It doesn't explain 
everything, but as a sunmary result, it presents a fairly simple and accurate 
picture. 



Let me just say a few words about the implications of these kinds of results for 
general attention rode Is and processes. We would want to argue that the bottle- 
neck that 9 8 seen in a large class of selective attention studies, that Broadbent 
originally described with a filter model, is identified with the limited serial 
comparison search process that we've been talking about in these studies. 
Subjects go through these items one at a time, comparing them, the limited, 
serial-comparison, search process may be identified with the bottleneck in the 
filter model. On the other hand, we suggest that all the stimuli are processed 
automatically to very high levels, up to perhaps meaning in some cases, so that 
processing doesn't stop at a low level. In that sense, the model is very much 
like that of say, Deutsch and Deutsch. So this is really a hybrid approach, a 
combination of the filter model, if you like, and the Deutsch & Deutsch ap- 
proach. Finally, consider the kinds of evidence that led Anne Treiaaan to 
propose her modification of the filter model, the attenuation approach. Her 
evidence mostly consisted of demonstrations that the stimuli in not-to-be-at- 
tended locations, occasionally receive processing or cause effects. To a large 
degree, the kinds of results that I've been presenting here suggest that such 
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results can be understood as the result of the "to be ignored" stimuli automa- 
tically, attracting attention for one reason or another. Such stimuli might 
have a learned tendency to attract attention (such as a person's own name in a 
to-be-ignored location) or might have an innate tendency to attract attention 
(such as an unusual color or loud noise causing a startle response) . Many of 
the results in which to-be-ignored stimuli are nevertheless given processing may 
be due to their tendency to attract attantion automatically. 



This theoretical approach uses automatism as a basis for understanding selective 
attention. The approach has the merit of combining the research in selective 
attention with that in the search domain, and can explain both at the same 
time. 

Most of the recent studies we've carried out have used above threshold stimuli, 
and one might ask "wh«it happens if you try to extend this reasoning and this 
model to near threshold stimuli?" The reason this is an important question is 
that one might suspect autoo^tian or automatic detection doesn't work very well 
for stimuli which are perceived so poorly that they are ambiguous. A number of 
yeari back, before I started the automatic work, I was involved with a series of 
threshold studies using what we called the successive-simultaneous procedure. 
Let me give just one typical example of a paradigm in this successive-simul- 
taneous procedure. In the simultaneous condition, all *:he stimuli were presented 
simultaneously, preceded and followed by masks. In the successive condition, 
the same stimuli appeared one at a tJ ~e, each preceded and followed by masks, 
and each presented for the same as they were in the simultaneous condition. (In 
some cases, we presented one diagonal at a time.) Subjects vere asked to detect 
whether an "F" or "T" was in the display. It can then be asked whether detection 
accuracy is b* tter in the successive conditions. If dividing attention among 
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simultaneous positions causes fewer features to be extracted frcm a given charac- 
ter in a given position, then the simultaneous condition should result in poorer 
performance. Well, we carried out a dozen experiments along these lines, all of 
which showed that the simultaneous condition was essentially as good as the 
successive condition (or could be if you ran the experiments the right way) we 
therefore concluded that the amount of information extracted frcm a given dis- 
play location was essentially equivalent in the simultaneous and successive 
conditions. This was evidence in favor of some sort of late selection model, 
rather than early selection model, because the information extraction was ap- 
parently equivalent in these two conditions. 



But these results still raise a rather large puzzle, which is, "how could the 
simultaneous and successive conditions give equal performance? " Even if the 
amount of information extracted from each position is the same, somewhere along 
the line, attention is going to be operating. Even in short term store, the 
subjects have to scan these stimuli, and to decide about them; since the scanning 
and deciding is known to be capacity limited, one would still expect subjects to 
do better in the sequential condition. We once suggested that short-te th store 
capacity wasn't exceeded, so that all the deciding and all the comparing that 
was needed could be carried out even in the simultaneous condition. That wasn't 
a very satisfying explanation. 

Well, these automatic and controlled results provide one possible mechanism to 
explain the equality of performance. The idea is very simple. These simul- 
taneous-successive studies were all run in CM conditions. 1.^ targets were 
always targets and they remained so. The idea then is that the subject devel- 
oped attention responses to these CM targets. The difficulty of that explanation 
is that these stimuli are at threshold. That is, these stiF- i are seen ambi- 
guously sometimes, and how can this fact give rise to automatx . detection? I 
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now propose one explanation. The idea is sijnple. Sometimea under these thresh- 
old conditions, a letter is seen clearly; in those cases, the normal automatic 
attention response occurs, as one would expect in above threshold conditions. 
Sometimes one doesn't see the whole letter, but sees a unique pattern of fea- 
tures, a pattern that still serves to uniquely identify the letter. That unique 
pattern, during training, would also develop an automatic attention response, 
just as the name or the letter does, as long as it's unique. Thus unique partial 
features also call attention to the target location. The only really ambiguous 
cases occur when a feature or features are abstracter that do not serve to 
segregate targets from distractors consistently. In those cases, the automatic 
attention system can't operate, but on the other hand, in such cases using 
controlled processing won't help either. That is, if the target is ambiguous 
and distr actors are ambiguous, it's not going to help to use control processing 
to search the display anyway. Thus automatic attention attraction can explain 
why the subject is able to exhibit equality of performance in the simultaneous 
and successive conditions. I should point out, it's very easy to test this 
argument. In fact, Dave Fogle has carried out a study at Indiana verifying this 
conjecture. He has carried out a simultaneous-successive study using both VM 
and CM training. In CM training, he found the usual equality of simultaneous and 
sequential conditions, but in VM training, he found a considerable advantage for 
successive conditions. 



One supplementary point should be mentioned*. John Duncan's results, that he 
reported and reviewed recently in Psychological Review , showed a deficit even in 
CM conditions for detecting simultaneously two targets occurring in the same 
fram„. In those cases where he has two targets presented, he finds a deficit in 
the simultaneous condition relative to the sequential condition. That'? explain- 
able in our attention terms, as follows: when two targets are both calling 
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attention to themselves, obviously both can't receive processing "first". At 
least one of them has to be delayed, in which case you're back into a situation 
where the successive condition should do better. Such reasoning can explain 
Duncan* s results. 

This argument, if accepted, does not affect the original conclusion from these 
studies, that the information extracted from a given location is the same in the 
simultaneous and successive conditions. That is, even if attention is being 
directed automatically to a given location, it's necessary that the goodness of 
the features extracted in the simultaneous and successive conditions be the 
same. If not, the attention attraction would be superior in the successive 
condition, and performance would be inferior in the successive condition. Thus 
the original conclusion we drew is still justified, but we are now interpreting 
the equality of performance in terms of a directing of attention due to an 
automatic response. 

There's a number of other things that occur automatically in the processing 
system that affect attention and selection that perhaps ought to be touched on. 
The participants in this conference are going to be mentioning various of these. 
I want to mention just one in particular which might be called "automatic ssgrega 
tion" (or partitioning or grouping); this used to be studied often by Gestalt 
psychologists. In recent research, a series of studies by Anne Treisman has 
looked at this problem in the attention domain. I am raising the issue of some 
sort of automatic partitioning of a visual display into regions on one basis or 
another. This notion is not necessarily the same as attention being directed 
automatically to given regions, but simply an automatic segregation or parti- 
tioning. 
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As an example, consider these three displays. What we see is probably an auto- 
matic tendency for you to group these letters into two regions over here in the 
left panel, a series of columns over there in the middle panel, and a single 
zero in a surround of "X's" in the right hand panel. Note that there are really 
two things that have to be separated here. One is some tendency for automatic 
grouping to occur, and the second is perhaps some tendency for attention to be 
attracted to one of the grouped regions. Perhaps attention is attracted to this 
column, in this case, or perhaps attention is attracted automatically to that 
single zero, and perhaps there's some general rule that says attention tends to 
be attracted toward the mailer region rather than the larger region. But these 
are two separate issues; one is the tendency to automatically segregate or group 
the display, and the other is the tendency for attention to be directed toward 
one of these grouped regions. These are separable issues and can be experi- ^ 
mentally studied. For example, if attention is being attracted to this isolated 
zero, one can check that by seeing if the presence of that zero harm a search 
among all .nese "x's" and if there's a distance affect such that the closer a 
target is among these "x's" to the zero, the more harm to the search would take 
place. This is a very easy experiment to carry out. For example, if I asked 
you to find a "y", or tell me whether or not a "y" is somewhere in the "x's 91 as 
in this case, one could test whether the search time would be affected by the 
proximity of the "y" to the "zero". I think it's important that such tests be 
done, although I think that not too much has been done yet in terms of trying to 
separate those two issues. 



Question: Are you saying that partitioning and grouping attract attention? 

Answer: What I'm saying is that there are two separate processes: attention 
and grouping. Both may occur automatically, but they may not be completely 
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identical. There may be a tendency sometimes for attention to be directed 
toward a region that haa been automatical ly grouped. But these are two separate 
issues, and they should be experimental ly separated so that you can distinguish 
the two. 

Question: Does training determine the grouping? 

Answer: Not in these examples. I think it's the nature of the stimuli that 
determine the grouping. This works best, of course, when you've got color 
displays, when colors are the basis for segregation. Anne Treianan has a number 
of results on that, having to do with color being very good. Shape is perhaps 
not quite as good, and you can go down the line. Regularities and irregular- 
ities in displays, oddities, have a good effect. But tM main point I'm trying 
to make is that ws should try to separate out the two aspects, the automatic 
attention attraction and the automatic grouping or segregation, both of which 
can affect attention and selection. One experiment that Sue Diana is carried out 
not too long ago, is worth pointing out, which gets at both th*se issues, and is 
worth mentioning. 

What we did was train up M A M targets and "B" distractors in a consistent mapping 
condition. After training, we presented a field which contained both targets 
and distractors. Is there a tendency as a result of the training to partition, 
segregate or group thxs field into two regions or two sections, one containing 
the targets, and one containing the distractors? Sue Dumais carried out a 
counting test to get at this. We asked the subjects to count the the number of 
designated elements in the display, ignoring the background elements. In control 
conditions we didn't give consistent training; we simply asked the subject to 
count various things, like count the "V"s in a background of upside down "V"s. 
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That was a hard test, that was always very difficult. It took a long time per 
element. It took just as long to count numbers in letters or vice versa. 
However, it was easy to count O's and X's. That was very easy. O's in x's is 
one of those things you can do automatically, somehow or another, and shows no 
slope effect of the irrelevant items. That is the number of the target 3 to be 
counted is a determinant of the reaction time, but the number of distracting 
elements has no effect. 



Then, what we do is carry out CM training of letter targets in number dis trac- 
tors. After consistent training, we do the same experiment, and ask the same 
questions. 



Let me show you some typical results here. These are abstracted from all the 
results. These conditions are a selection of those trials on which only one 
target was to be counted, so the answer was always "one" . We have graphed 
reaction time as a function of the number of irrelevant items in the displays. 
After training of letters in numbers, using the CM procedure, we see these 
results: it is quite a bit easier to count letters in numbers and only a little 
bit harder to count numbers in letters. That is even numbers in letters is 
better than performance before CM training on letters in numbers. 

We think that there's two effects going on. The reason both letters and number 
targets are helped is that there's some tendency for grouping of the display 
into two regions, targets versus distractors, which helps counting. In addition, 
there's this automatic tendency to attract attention, that causes you to do 
better on the trained targets, the letters, than the distractors, the numbers. 
We suggested that the result of training is that when you show a display of 
numbers and letters to a subject and ask him to count, his percept after training 
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will look something like this in exaggerated form, with large letters standing 
out in a background of nail numbers. This shows an accentuation of the targets 
and a diminution of the dis tractors. And we argue, it's easier to count the 
letters than it is to count the numbers, although it's the segregation that 
let's you count either set better than an homogenous set. 

There's a variety of mechanisms by which this counting could take place. Mote 
that the reaction times are very much slower than search reaction times in this 
study. These are like 150 to 200 milliseconds per count, so that they're doing 
something a lot slower than the normal serial scan (40 msec per comparison). 
They may even have time to scan over the whole display first to find all the 
targets, and then go back and count them. We don't know the exact mechanism, 
nor do we know for sure that the segregation that's occurring here is an automa- 
tic segregation or grouping, or whether Jt's a controlled segregation or grouping. 
There's enough time for the subject to carry out a search or a scan to segregate 
these displays into two halves, using some sort of controlled process if they 



want to. This is a matter for future research. Dan, do you have a question? 

Question: Is this a matter of foreground-background perception? 

Answer: Not exactly. We are using two types of characters so that in some 
sense there's a foreground and background, or something like that. I don't know 
if you want to call it that, it may be a foreground and background segregation. 
However, if we used three or four character types, then grouping may be into 
four regions with no one standing out especially, or with all of the CM trained 
characters standing out. 
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Question: Is training sufficient to produce this grouping effect? 



Answer: That's a good question: some things are very difficult to train. Anne 
Triesman has shorn that there's some things that are difficult to train, like 
conjunction of shape and color. Of course we'd have to train a long, long time, 
to be sure of what the limits are, but there's some controversy. Wally Schneider 
has some results indicating that for example, conjunction of features may be 
partially trainable, and Anne has some results that says, that she doesn't think 
so. At the Boment it's a controversial issue, and perhaps I don't want to go 
into that today. What can be trained and what can't be trained is an interest- 
ing question, and I don't have too much to say about that right now. I think a 
lot of things can be trained, probably a lot more than we suspect, but there 
probably are limits, and I don't know yet what they are. 



Question: Are your present results due to the use of letters and numbers? 



Answer: Letters and numbers were used in this case, but none of our results in 
search processing lead us to think that there's anything special about letters 
and numbers in this respect, except that they're a little faster to train. 
We've always been able to duplicate all our results with letters and numbers, 
using letters and letters, and using highly confusing sets of stimuli, it just 
takes longer to train them. We used letters and numbers in this case because we 
didn't want to spend a few months of training, but none of our previous results 
leads us to think that there is anything special about these sets. It would be 
useful to carry out CM training, let's say on letters and letters, or some con- 
f usable sets, to see what the limits would be. 
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Question: kx% the presented displays data limited? 



Answer: That depends on what you mean by data limits. Our displays are not 
data limited in the usual sense since everything is seen clearly. The subject 
can clearly see all the characters. These are quite visible, quite contrasty, 
quite clear. 

As a final conclusion, I want to say on the one hand that the notion of automa- 
tism and controlled processing is a fairly useful and valid dichotomy, and on 
the other hand that our theoretical dichotomy goes far towards explaining many 
of the mechanisms of selective attention, although obviously we're just begin- 
ning work in this whole area. 

Question: Is there a special role played by color? 

Answer: In part, that's answerable by experimentation, some of which has been 
done. I think that it undoubtedly is the case that color serves as a basis for 
discrimination, because it has some automatic properties like segregation, or 
attention attraction, or both, that will then carry selection effects with it. 
I don't think there's much question about tnat. I think that can be carried by 
innate properties like color discrimination or by trained properties. Maybe you 
could train any stimulus with enough training so it acts in the same way. 

Question: Many studies have shown color to be a "special " cue of special salience, 
different from "response" sets. 

Answer: Well, now, wit a minute Dan. This is an open experimental question. 
There is a general tendency for oddity to be able to serve as a cue in general. 

3S, 3 *« 



If you only have two stimuli, red and blue or blue and red, and you switch back 
and forth, you might be able to use automatic segregation. However, it doesn't 
even have to be reds and Itlues, say it could be x's and o's or o's and x's, you 
can switch back and forth in the VM procedure, and might still get automatic 
segregation. 

Question: What do you mean by oddity? 

Answer: I mean some kind of perceptual oddity based on the stimulus pattern. 
There's clearly an issue of what happens with a snail set of objects in a VM 
procedure. Something is very special about that, and I think that we can rede- 
fine the stimuli in those cases a little bit if we want, and still capture the 
attention attraction property, so we won't have to worry about the VM procedure. 
However, this business of color as being a special case is not at all clear. 
We're trying out an experiment with seven colors in the VM procedure. We rather 
suspect that when you increase the number of colors, we'll begin to get results 
that look more like the letter- number results. That is, in the VM procedure 
with seven colors, wh3re you're told in a given trial which color to look for, 
we suspect the results will not look like the traditional color result that you 
can select out the one color you want from the whole set, and show no slope 
effect. With seven colors, it might be necessary to change for CM to VM to get 
the two different results. That is, there's something special I think about 
small sets that's not unique to colors, that may be unique to some kind of 
oddity result. This is an issue that's a little premature to talk about, because 
we're exploring this now. All I want to say is that the result that you're 
claiming is not at all clear, and it's an experimental question and we're look- 
ing at it now. 
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Questions Would zero slopes occur with any two different characters or dimen- 
sions? 

Answer: We don't know. There's two issues here. One is the oddity question, 
and one is the VM or CM mapping, and both have to be explored. 

Question: Can subjects learn to reverse quickly after much training? 



Answer: I reserve judgment, lie tried that. We took subjects and tried reversal, 
then reversed them back, reversed them back, and kept this up for a while • I 
think we've got it to about four once. We never carried it far enough to really 
be sure what was happening. It was clearly taking some time to reverse. Whether 
it was talcing less time each time you reversed relative to some appropriate 
control and so on, well we never carried out an experiment with appropriate 
controls or carried it out far enough to be sure. It's a good question, Wally 
Schneider is studying this now. 



Question: I thought automatism and controlled processing are dichotomous. How 
then Can gradual learning occur? 

Answer: There's two separate processes. One of them at least, the automatic 
one, shows continuous change with practice. 
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The point is, there's two separate tracks, there's two separate processes. One 
operates, perhaps, by attention attraction, and one operates by the subject 
volitionally assigning attention to different items in some sort of serial 
search. Thus there are two separate tracks that the subject can use to perform 
in these sorts of situations. The attention attraction may gradually develop 
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over trials as you practice the subject, and eventually the subject will switch 
over to using it almost entirely. But, the fact that there's two separate 
processes does not mean that one can't develop with practice. 



Question: What happens when the subject switches from controlled to automatic 
processing? How does the switch occur? 

Answer: Well, there's basically two models. One model says that the controlled 
search being carried out in CM training in early trials is more or less irrele- 
vant to the eventual production of the automatic response. In this view, VM 
search being carried out is only sort of a holding pattern that simply maintains 
the conditions necessary for the eventual development of your automatic response. 
In this view, controlled search simply allows the necessary conditions to take 
place, such that automation will be developed. (We have carried out some experi- 
ments along these lines, and Schneider's carried out a few. We have studied 
things like how many targets you need to present versus how many distractors; 
how many times an item can appear as * target versus a distract or and still let 
the automatic attention response develop; how many times you need search for an 
item and find it versus not finding it. Things like this. We're trying to 
find out how fast automation develops in these cases. That's another talk I 
give. I don't really have time to go into this now. It's another issue that 
would take another hour , so I don't think it would be wise to get into it now.) 
The other view is that somehow, something the subject is doing in the controlled 
search is a necessary condition for the eventual development of automatism. 
What that is we're not sure. In either case, we're not sure yet. 



Mil 

356 



Question: Can the continuous nodal be distinguished from the dichotomous model? 

Answer: Distinguishing the continuous model from the dichotooous model is one 
of these things that's going to be a long time in being developed. Finding a 
good test for that may never prove possible. I personally think that these two 
models are going to be accepted or rejected on the basis of their general fit to 
a wide set of data, rather than any one test. I'm not at all sure that the 
continuous model versus the dichotooous model can ever be distinguished by any 
simple experimental test. I think it's simply a matter of what you find more 
convenient, and which one works better over a wide set of experiments, a general 
test would be very hard. 



If anybody has any idea on any firm tests to help distinguish those, please let 
me know. I rather doubt you can come up with one. 
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